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ISOLATED HUMAN TRANSPORTER PROTEINS, NUCLEIC ACID 
MOLECULES ENCODING HUMAN TRANSPORTER PROTEINS, 

AND USES THEREOF 

FIELD OF THE INVENTION 

The present invention is in the field of transporter proteins that are related to the 
glutamate receptor subfamily, recombinant DNA molecules, and protein production. The 
present invention specifically provides novel peptides and proteins that effect ligand 
transport and nucleic acid molecules encoding such peptide and protein molecules, all of 
which are useful in the development of human therapeutics and diagnostic compositions 
and methods. 

BACKGROUND OF THE INVENTION 

Transporters 

Transporter proteins regulate many different functions of a cell, including cell 
proliferation, differentiation, and signaling processes, by regulating the flow of molecules 
such as ions and macromolecules, into and out of cells. Transporters are found in the 
plasma membranes of virtually every cell in eukaryotic organisms. Transporters mediate 
a variety of cellular functions including regulation of membrane potentials and absorption 
and secretion of molecules and ion across cell membranes. When present in intracellular 
membranes of the Golgi apparatus and endocytic vesicles, transporters, such as chloride 
channels, also regulate organelle pH. For a review, see Greger, R. (1988) Annu. Rev. 
Physiol 50:111-122. 

Transporters are generally classified by structure and the type of mode of action. 
In addition, transporters are sometimes classified by the molecule type that is transported, 
for example, sugar transporters, chlorine channels, potassium channels, etc. There may 
be many classes of channels for transporting a single type of molecule (a detailed review 
of channel types can be found at Alexander, S.P.H. and J. A. Peters: Receptor and 
transporter nomenclature supplement. Trends Pharmacol. Sci., Elsevier, pp. 65-68 (1997). 
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The following general classification scheme is known in the art and is followed in 
the present discoveries. 

Channel-type transporters. Transmembrane channel proteins of this class are 
ubiquitously found in the membranes of all types of organisms from bacteria to higher 
eukaryotes. Transport systems of this type catalyze facilitated diffusion (by an energy- 
independent process) by passage through a transmembrane aqueous pore or channel 
without evidence for a carrier-mediated mechanism. These channel proteins usually 
consist largely of a-helical spanners, although b-strands may also be present and may 
even comprise the channel. However, outer membrane porin-type channel proteins are 
excluded from this class and are instead included in class 9. 

Carrier-type transporters. Transport systems are included in this class if they 
utilize a carrier-mediated process to catalyze uniport (a single species is transported by 
facilitated diffusion), antiport (two or more species are transported in opposite directions 
in a tightly coupled process, not coupled to a direct form of energy other than 
chemiosmotic energy) and/or symport (two or more species are transported together in 
the same direction in a tightly coupled process, not coupled to a direct form of energy 
other than chemiosmotic energy). 

Pyrophosphate bond hydrolysis-driven active transporters. Transport systems are 
included in this class if they hydrolyze pyrophosphate or the terminal pyrophosphate 
bond in ATP or another nucleoside triphosphate to drive the active uptake and/or 
extrusion of a solute or solutes. The transport protein may or may not be transiently 
phosphorylated, but the substrate is not phosphorylated, 

PEP-dependent, phosphoryl transfer-driven group translocators. Transport 
systems of the bacterial phosphoenolpyruvate:sugar phosphotransferase system are 
included in this class. The product of the reaction, derived from extracellular sugar, is a 
cytoplasmic sugar-phosphate. 

Decarboxylation-driven active transporters. Transport systems that drive solute 
(e.g., ion) uptake or extrusion by decarboxylation of a cytoplasmic substrate are included 
in this class. 
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Oxidoreduction-driven active transporters. Transport systems that drive transport 
of a solute (e.g., an ion) energized by the flow of electrons from a reduced substrate to an 
oxidized substrate are included in this class. 

Light-driven active transporters. Transport systems that utilize light energy to 
drive transport of a solute (e.g., an ion) are included in this class. 

Mechanically-driven active transporters. Transport systems are included in this 
class if they drive movement of a cell or organelle by allowing the flow of ions (or other 
solutes) through the membrane down their electrochemical gradients. 

Outer-membrane porins (of b-structure). These proteins form transmembrane 
pores or channels that usually allow the energy independent passage of solutes across a 
membrane. The transmembrane portions of these proteins consist exclusively of b-strands 
that form a b-barrel. These porin-type proteins are found in the outer membranes of 
Gram-negative bacteria, mitochondria and eukaryotic plastids. 

Methyltransferase-driven active transporters. A single characterized protein 
currently falls into this category, the Na+-transporting 
methyltetrahydromethanopterinxoenzyme M methyltransferase. 

Non-ribosome-synthesized channel-forming peptides or peptide-like molecules. 
These molecules, usually chains of L- and D-amino acids as well as other small 
molecular building blocks such as lactate, form oligomeric transmembrane ion channels. 
Voltage may induce channel formation by promoting assembly of the transmembrane 
channel. These peptides are often made by bacteria and fungi as agents of biological 
warfare. 

Non-Proteinaceous Transport Complexes. Ion conducting substances in biological 
membranes that do not consist of or are not derived from proteins or peptides fall into this 
category. 

Functionally characterized transporters for which sequence data are lacking. 
Transporters of particular physiological significance will be included in this category 
even though a family assignment cannot be made. 

Putative transporters in which no family member is an established transporter. 
Putative transport protein families are grouped under this number and will either be 
classified elsewhere when the transport function of a member becomes established, or 
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will be eliminated from the TC classification system if the proposed transport function is 
disproves These families include a member or members for which a transport function 
has been suggested, but evidence for such a function is not yet compelling. 

Auxiliary transport proteins. Proteins that in some way facilitate transport across 
one or more biological membranes but do not themselves participate directly in transport 
are included in this class. These proteins always function in conjunction with one or more 
transport proteins. They may provide a function connected with energy coupling to 
transport, play a structural role in complex formation or serve a regulatory function. 

Transporters of unknown classification. Transport protein families of unknown 
classification are grouped under this number and will be classified elsewhere when the 
transport process and energy coupling mechanism are characterized. These families 
include at least one member for which a transport function has been established, but 
either the mode of transport or the energy coupling mechanism is not known. 

Ion channels 

An important type of transporter is the ion channel. Ion channels regulate many 
different cell proliferation, differentiation, and signaling processes by regulating the flow 
of ions into and out of cells. Ion channels are found in the plasma membranes of virtually 
every cell in eukaryotic organisms. Ion channels mediate a variety of cellular functions 
including regulation of membrane potentials and absorption and secretion of ion across 
epithelial membranes. When present in intracellular membranes of the Golgi apparatus 
and endocytic vesicles, ion channels, such as chloride channels, also regulate organelle 
pH. For a review, see Greger, R. (1988) Annu. Rev. Physiol. 50:1 1 1-122. 

Ion channels are generally classified by structure and the type of mode of action. 
For example, extracellular ligand gated channels (ELGs) are comprised of five 
polypeptide subunits, with each subunit having 4 membrane spanning domains, and are 
activated by the binding of an extracellular ligand to the channel. In addition, channels 
are sometimes classified by the ion type that is transported, for example, chlorine 
channels, potassium channels, etc. There may be many classes of channels for 
transporting a single type of ion (a detailed review of channel types can be found at 
Alexander, S.P.H. and J. A. Peters (1997). Receptor and ion channel nomenclature 
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supplement. Trends Pharmacol. Sci. ? Elsevier, pp. 65-68 and http://www- 
biology.ucsd.edu/'-msaier/transport/toc.htmL 

There are many types of ion channels based on structure. For example, many ion 
channels fall within one of the following groups: extracellular ligand-gated channels 
(ELG), intracellular ligand-gated channels (ILG), inward rectifying channels (INR), 
intercellular (gap junction) channels, and voltage gated channels (VIC). There are 
additionally recognized other channel families based on ion-type transported, cellular 
location and drug sensitivity. Detailed information on each of these, their activity, ligand 
type, ion type, disease association, drugability, and other information pertinent to the 
present invention, is well known in the art. 

Extracellular ligand-gated channels, ELGs, are generally comprised of five 
polypeptide subunits, Unwin, N. (1993), Cell 72: 31-41; Unwin, N. (1995), Nature 373: 
37-43; Hucho, F. ? et al, (1996) J. Neurochem. 66: 1781-1792; Hucho, F., et al., (1996) 
Eur. J. Biochem. 239: 539-557; Alexander, S.P.H. and J.A. Peters (1997), Trends 
Pharmacol. ScL, Elsevier, pp. 4-6; 36-40; 42-44; and Xue, H. (1998) J. Mol. Evol. 47: 
323-333. Each subunit has 4 membrane spanning regions: this serves as a means of 
identifying other members of the ELG family of proteins. ELG bind a ligand and in 
response modulate the flow of ions. Examples of ELG include most members of the 
neurotransmitter-receptor family of proteins, e.g., GABAI receptors. Other members of 
this family of ion channels include glycine receptors, ryandyne receptors, and ligand 
gated calcium channels. 

The Voltage-gated Ion Channel (VIC) Superfamily 

Proteins of the VIC family are ion-selective channel proteins found in a wide 
range of bacteria, archaea and eukaryotes Hille, B. (1992), Chapter 9: Structure of 
channel proteins; Chapter 20: Evolution and diversity. In: Ionic Channels of Excitable 
Membranes, 2nd Ed., Sinaur Assoc. Inc., Pubs., Sunderland, Massachusetts; Sigworth, 
F.J. (1993), Quart. Rev. Biophys. 27: 1-40; Salkoff, L. and T. Jegla (1995), Neuron 15: 
489-492; Alexander, S.P.H. et al, (1997), Trends Pharmacol. ScL, Elsevier, pp. 76-84; 
Jan, L.Y. et al., (1997), Annu. Rev. Neurosci. 20: 91-123; Doyle, D.A, et al., (1998) 
Science 280: 69-77; Terlau, H. and W. Sttihmer (1998), Naturwissenschaften 85: 437- 
444. They are often homo- or heterooligomeric structures with several dissimilar 
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subunits (e.g., al-a2-d-b Ca 2+ channels, abib2 Na + channels or (a)4-b K + channels), but 
the channel and the primary receptor is usually associated with the a (or al) subunit. 
Functionally characterized members are specific for K + , Na + or Ca 2+ . The K + channels 
usually consist of homotetrameric structures with each a-subunit possessing six 
transmembrane spanners (TMSs). The al and a subunits of the Ca and Na channels, 
respectively, are about four times as large and possess 4 units, each with 6 TMSs 
separated by a hydrophilic loop, for a total of 24 TMSs. These large channel proteins 
form heterotetra-unit structures equivalent to the homotetrameric structures of most K + 
channels. All four units of the Ca 2+ and Na + channels are homologous to the single unit in 
the homotetrameric K + channels. Ion flux via the eukaryotic channels is generally 
controlled by the transmembrane electrical potential (hence the designation, voltage- 
sensitive) although some are controlled by ligand or receptor binding. 

Several putative K + -selective channel proteins of the VIC family have been 
identified in prokaryotes. The structure of one of them, the KcsA K + channel of 
Streptomyces lividans, has been solved to 3.2 A resolution. The protein possesses four 
identical subunits, each with two transmembrane helices, arranged in the shape of an 
inverted teepee or cone. The cone cradles the "selectivity filter" P domain in its outer end. 
The narrow selectivity filter is only 12 A long, whereas the remainder of the channel is 
wider and lined with hydrophobic residues. A large water-filled cavity and helix dipoles 
stabilize K + in the pore. The selectivity filter has two bound K + ions about 7.5 A apart 
from each other. Ion conduction is proposed to result from a balance of electrostatic 
attractive and repulsive forces. 

In eukaryotes, each VIC family channel type has several subtypes based on 
pharmacological and electrophysiological data. Thus, there are five types of Ca 2+ 
channels (L, N, P, Q and T). There are at least ten types of K + channels, each responding 
in different ways to different stimuli: voltage-sensitive [Ka, Kv, Kvr, Kvs and Ksr], Ca 2+ - 
sensitive [BKc a? IKc a and SKcJ and receptor-coupled [Km and K A ch]- There are at least 
six types of Na + channels (I, II, III, jxl, HI and PN3). Tetrameric channels from both 
prokaryotic and eukaryotic organisms are known in which each a-subunit possesses 2 
TMSs rather than 6, and these two TMSs are homologous to TMSs 5 and 6 of the six 
TMS unit found in the voltage-sensitive channel proteins. KcsA of S. lividans is an 
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example of such a 2 TMS channel protein. These channels may include the Kn 3 (Na + - 
activated) and Ky 0 i (cell volume-sensitive) K + channels, as well as distantly related 
channels such as the Tokl K + channel of yeast, the TWIK-1 inward rectifier K + channel 
of the mouse and the TREK-1 K + channel of the mouse. Because of insufficient sequence 
similarity with proteins of the VIC family, inward rectifier K + IRK channels (ATP- 
regulated; G-protein-activated) which possess a P domain and two flanking TMSs are 
placed in a distinct family. However, substantial sequence similarity in the P region 
suggests that they are homologous. The b, g and d subunits of VIC family members, 
when present, frequently play regulatory roles in channel activation/deactivation. 

The Epithelial Na + Channel (ENaC) Family 

The ENaC family consists of over twenty-four sequenced proteins (Canessa, 
CM., et al, (1994), Nature 367; 463-467, Le, T. and M.H. Saier, Jr. (1996), Mol. 
Membr, Biol. 13: 149-157; Garty, H. and L.G. Palmer (1997), Physiol. Rev. 77: 359-396; 
Waldmann, R., et al., (1997), Nature 386: 173-177; Darboux, L, et al., (1998), J. Biol. 
Chem. 273: 9424-9429; Firsov, D., et al., (1998), EMBO J. 17: 344-352; Horisberger, L- 
D. (1998). Curr. Opin. Struc. Biol. 10: 443-449). All are from animals with no 
recognizable homologues in other eukaryotes or bacteria. The vertebrate ENaC proteins 
from epithelial cells cluster tightly together on the phylogenetic tree: voltage-insensitive 
ENaC homologues are also found in the brain. Eleven sequenced C. elegans proteins, 
including the degenerins, are distantly related to the vertebrate proteins as well as to each 
other. At least some of these proteins form part of a mechano-transducing complex for 
touch sensitivity. The homologous Helix aspersa (FMRF-amide)-activated Na + channel is 
the first peptide neurotransmitter-gated ionotropic receptor to be sequenced. 

Protein members of this family all exhibit the same apparent topology, each with 
N- and C-termini on the inside of the cell, two amphipathic transmembrane spanning 
segments, and a large extracellular loop. The extracellular domains contain numerous 
highly conserved cysteine residues. They are proposed to serve a receptor function. 

Mammalian ENaC is important for the maintenance of Na + balance and the 
regulation of blood pressure. Three homologous ENaC subunits, alpha, beta, and gamma, 
have been shown to assemble to form the highly Na + -selective channel. The 
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stoichiometry of the three subunits is alphas betal, gammal in a heterotetrameric 
architecture. 

The Glutamate- gated Ion Channel (GIO Family of Neurotransmitter Receptors 

Members of the GIC family are heteropentameric complexes in which each of the 
5 subunits is of 800-1000 amino acyl residues in length (Nakanishi, N., et al, (1990), 
Neuron 5: 569-581; Unwin, N. (1993), Cell 72: 31-41; Alexander, S.P.H. and J.A. Peters 
(1997) Trends Pharmacol. Sci., Elsevier, pp. 36-40). These subunits may span the 
membrane three or five times as putative a-helices with the N-termini (the glutamate- 
binding domains) localized extracellularly and the C-termini localized cytoplasmically. 
They may be distantly related to the ligand-gated ion channels, and if so, they may 
possess substantial b-structure in their transmembrane regions. However, homology 
between these two families cannot be established on the basis of sequence comparisons 
alone. The subunits fall into six subfamilies: a, b, g, d, e and z. 

The GIC channels are divided into three types: (1) a-amino-3-hydroxy-5-methyl- 
4-isoxazole propionate (AMP A)-, (2) kainate- and (3) N-methyl-D-aspartate (NMDA)- 
selective glutamate receptors. Subunits of the AMP A and kainate classes exhibit 35-40% 
identity with each other while subunits of the NMDA receptors exhibit 22-24% identity 
with the former subunits. They possess large N-terminal, extracellular glutamate-binding 
domains that are homologous to the periplasmic glutamine and glutamate receptors of 
ABC-type uptake permeases of Gram-negative bacteria. All known members of the GIC 
family are from animals. The different channel (receptor) types exhibit distinct ion 
selectivities and conductance properties. The NMDA-selective large conductance 
channels are highly permeable to monovalent cations and Ca 2+ . The AMP A- and kainate- 
selective ion channels are permeable primarily to monovalent cations with only low 
permeability to Ca 2+ . 

The Chloride Channel (C1C) Family 

The C1C family is a large family consisting of dozens of sequenced proteins 
derived from Gram-negative and Gram-positive bacteria, cyanobacteria, archaea, yeast, 
plants and animals (Steinmeyer, K., et al., (1991), Nature 354: 301-304; Uchida, S., et al, 
(1993), J. Biol Chem. 268: 3821-3824; Huang, M.-E., et al, (1994), J. Mol. Biol. 242: 
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595-598; Kawasaki, M, et al, (1994), Neuron 12: 597-604; Fisher, W.R, et al., (1995), 
Genomics. 29:598-606; and Foskett, J.K. (1998), Annu. Rev. Physiol 60: 689-717). 
These proteins are essentially ubiquitous, although they are not encoded within genomes 
of Haemophilus influenzae, Mycoplasma genitaliurn, and Mycoplasma pneumoniae. 
Sequenced proteins vary in size from 395 amino acyl residues (M. jannaschii) to 988 
residues (man). Several organisms contain multiple C1C family paralogues. For example, 
Synechocystis has two paralogues, one of 451 residues in length and the other of 899 
residues. Arabidopsis thaliana has at least four sequenced paralogues, (775-792 
residues), humans also have at least five paralogues (820-988 residues), and C elegans 
also has at least five (810-950 residues). There are nine known members in mammals, 
and mutations in three of the corresponding genes cause human diseases. E. coli, 
Methanococcus jannaschii and Saccharomyces cerevisiae only have one C1C family 
member each. With the exception of the larger Synechocystis paralogue, all bacterial 
proteins are small (395-492 residues) while all eukaryotic proteins are larger (687-988 
residues). These proteins exhibit 10-12 putative transmembrane a-helical spanners 
(TMSs) and appear to be present in the membrane as homodimers. While one member of 
the family, Torpedo CIC-O, has been reported to have two channels, one per subunit, 
others are believed to have just one. 

All functionally characterized members of the C1C family transport chloride, 
some in a voltage-regulated process. These channels serve a variety of physiological 
functions (cell volume regulation; membrane potential stabilization; signal transduction; 
transepithelial transport, etc.). Different homologues in humans exhibit differing anion 
selectivities, i.e., C1C4 and C1C5 share a NO3" > CI" > Br" > I" conductance sequence, 
while C1C3 has an F > CI" selectivity. The C1C4 and C1C5 channels and others exhibit 
outward rectifying currents with currents only at voltages more positive than +20mV. 

Animal Inward Rectifier K + Channel (IRK-C) Family 

IRK channels possess the "minimal channel-forming structure" with only a P 
domain, characteristic of the channel proteins of the VIC family, and two flanking 
transmembrane spanners (Shuck, M.E., et al., (1994), J. Biol. Chem. 269: 24261-24270; 
Ashen, M.D., et al, (1995), Am. J. Physiol. 268: H506-H51 1; Salkoff, L. and T. Jegla 
(1995), Neuron 15: 489-492; Aguilar-Bryan, U et al, (1998), Physiol. Rev. 78: 227-245; 
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Ruknudin, A., et al, (1998), J. Biol. Chem. 273: 14165-14171). They may exist in the 
membrane as homo- or heterooligomers. They have a greater tendency to let K + flow into 
the cell than out. Voltage-dependence may be regulated by external K , by internal Mg , 
by internal ATP and/or by G-proteins. The P domains of IRK channels exhibit limited 
sequence similarity to those of the VIC family, but this sequence similarity is insufficient 
to establish homology. Inward rectifiers play a role in setting cellular membrane 
potentials, and the closing of these channels upon depolarization permits the occurrence 
of long duration action potentials with a plateau phase. Inward rectifiers lack the intrinsic 
voltage sensing helices found in VIC family channels. In a few cases, those of KirLla 
and Kir6.2, for example, direct interaction with a member of the ABC superfamily has 
been proposed to confer unique functional and regulatory properties to the heteromeric 
complex, including sensitivity to ATP. The SUR1 sulfonylurea receptor (spQ09428) is 
the ABC protein that regulates the Kir6.2 channel in response to ATP, and CFTR may 
regulate Kirl ,1a. Mutations in SUR1 are the cause of familial persistent hyperinsulinemic 
hypoglycemia in infancy (PHHI), an autosomal recessive disorder characterized by 
unregulated insulin secretion in the pancreas. 

ATP-gated Cation Channel (ACQ Family 

Members of the ACC family (also called P2X receptors) respond to ATP, a 
functional neurotransmitter released by exocytosis from many types of neurons (North, 
R.A. (1996), Curr. Opin. Cell Biol 8: 474-483; Soto, R, M. Garcia-Guzman and W. 
Stiihmer (1997), J. Membr. Biol. 160: 91-100). They have been placed into seven groups 
(P2Xi - P2X 7 ) based on their pharmacological properties. These channels, which function 
at neuron-neuron and neuron-smooth muscle junctions, may play roles in the control of 
blood pressure and pain sensation. They may also function in lymphocyte and platelet 
physiology. They are found only in animals. 

The proteins of the ACC family are quite similar in sequence (>35% identity), but 
they possess 380-1000 amino acyl residues per subunit with variability in length localized 
primarily to the C-terminal domains. They possess two transmembrane spanners, one 
about 30-50 residues from their N-termini, the other near residues 320-340. The 
extracellular receptor domains between these two spanners (of about 270 residues) are 
well conserved with numerous conserved glycyl and cysteyl residues. The hydrophilic C- 
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termini vary in length from 25 to 240 residues. They resemble the topologically similar 
epithelial Na + channel (ENaC) proteins in possessing (a) N- and C-termini localized 
intracellular^, (b) two putative transmembrane spanners, (c) a large extracellular loop 
domain, and (d) many conserved extracellular cysteyl residues. ACC family members 
are, however, not demonstrably homologous with them. ACC channels are probably 
hetero- or homomultimers and transport small monovalent cations (Me + ). Some also 
transport Ca 2+ ; a few also transport small metabolites. 

The Ryanodine-Inositol 1,4,5 -triphosphate Receptor Ca 2+ Channel (RIR-CaC) 

Family 

Ryanodine (Ry)-sensitive and inositol 1,4,5 -triphosphate (IP3)-sensitive Ca 2+ - 
release channels function in the release of Ca 24 " from intracellular storage sites in animal 
cells and thereby regulate various Ca 2+ -dependent physiological processes (Hasan, G. et 
al, (1992) Development 116: 967-975; Michikawa, T., et aL, (1994), J. Biol. Chem. 269: 
9184-9189; Tunwell, R.E.A., (1996), Biochem. J. 318: 477-487; Lee, A.G. (1996) 
Biomembranes, Vol. 6, Transmembrane Receptors and Channels (A.G. Lee, ed.), JAI 
Press, Denver, CO., pp 291-326; Mikoshiba, K., et al, (1996) J. Biochem. Biomem. 6: 
273-289). Ry receptors occur primarily in muscle cell sarcoplasmic reticular (SR) 
membranes, and IP3 receptors occur primarily in brain cell endoplasmic reticular (ER) 
membranes where they effect release of Ca 2+ into the cytoplasm upon activation 
(opening) of the channel. 

The Ry receptors are activated as a result of the activity of dihydropyridine- 
sensitive Ca 2+ channels. The latter are members of the voltage-sensitive ion channel 
(VIC) family. Dihydropyridine-sensitive channels are present in the T-tubular systems of 
muscle tissues. 

Ry receptors are homotetrameric complexes with each subunit exhibiting a 
molecular size of over 500,000 daltons (about 5,000 amino acyl residues). They possess 
C-terminal domains with six putative transmembrane a -helical spanners (TMSs). 
Putative pore-forming sequences occur between the fifth and sixth TMSs as suggested for 
members of the VIC family. The large N-terminal hydrophilic domains and the small C- 
terminal hydrophilic domains are localized to the cytoplasm. Low resolution 3- 
dimensional structural data are available. Mammals possess at least three isoforms that 
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probably arose by gene duplication and divergence before divergence of the mammalian 
species. Homologues are present in humans and Caenorabditis elegans. 

IP3 receptors resemble Ry receptors in many respects. (1) They are 
homotetrameric complexes with each subunit exhibiting a molecular size of over 300,000 
daltons (about 2,700 amino acyl residues). (2) They possess C-terminal channel domains 
that are homologous to those of the Ry receptors. (3) The channel domains possess six 
putative TMSs and a putative channel lining region between TMSs 5 and 6. (4) Both the 
large N-terminal domains and the smaller C-terminal tails face the cytoplasm. (5) They 
possess covalently linked carbohydrate on extracytoplasmic loops of the channel 
domains. (6) They have three currently recognized isoforms (types 1, 2, and 3) in 
mammals which are subject to differential regulation and have different tissue 
distributions. 

IP3 receptors possess three domains: N-terminal DVbinding domains, central 
coupling or regulatory domains and C-terminal channel domains. Channels are activated 
by IP3 binding, and like the Ry receptors, the activities of the IP3 receptor channels are 
regulated by phosphorylation of the regulatory domains, catalyzed by various protein 
kinases. They predominate in the endoplasmic reticular membranes of various cell types 
in the brain but have also been found in the plasma membranes of some nerve cells 
derived from a variety of tissues. 

The channel domains of the Ry and IP3 receptors comprise a coherent family that 
in spite of apparent structural similarities, do not show appreciable sequence similarity of 
the proteins of the VIC family. The Ry receptors and the IP3 receptors cluster separately 
on the RIR-CaC family tree. They both have homologues in Drosophila, Based on the 
phylogenetic tree for the family, the family probably evolved in the following sequence: 
(1) A gene duplication event occurred that gave rise to Ry and IP3 receptors in 
invertebrates. (2) Vertebrates evolved from invertebrates. (3) The three isoforms of each 
receptor arose as a result of two distinct gene duplication events. (4) These isoforms were 
transmitted to mammals before divergence of the mammalian species. 

The Organellar Chloride Channel (Q-C1C) Family 

Proteins of the O-CIC family are voltage-sensitive chloride channels found in 
intracellular membranes but not the plasma membranes of animal cells (Landry, D, et al, 
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(1993), J. Biol Chem. 268: 14948-14955; Valenzuela, Set aL, (1997), J. Biol. Chem. 272: 
12575-12582; and Duncan, R.R., et aL, (1997), J. Biol. Chem. 272: 23880-23886). 

They are found in human nuclear membranes, and the bovine protein targets to 
the microsomes, but not the plasma membrane, when expressed in Xenopus laevis 
oocytes. These proteins are thought to function in the regulation of the membrane 
potential and in transepithelial ion absorption and secretion in the kidney. They possess 
two putative transmembrane a-helical spanners (TMSs) with cytoplasmic N- and ex- 
ternum and a large luminal loop that may be glycosylated. The bovine protein is 437 
amino acyl residues in length and has the two putative TMSs at positions 223-239 and 
367-385. The human nuclear protein is much smaller (241 residues). A C. elegans 
homologue is 260 residues long. 

Glutamate receptor 

Glutamate receptors are the predominant excitatory neurotransmitter receptors in 
the mammalian brain and are activated in a variety of normal neurophysiology processes. 
The classification of glutamate receptors is based on their activation by different 
pharmacologic agonists. Thus, glutamate receptors have been named according to their 
respective agonists, the N-methyl-D-aspartate, quisqualic acid (QUIS), kainate (KA), and 
2-amino-4-phosphonobutyrate (AP4) receptors. 

The present invention has substantial similarity to a human glutamate receptor 
(GLUH1). Puckett et aL (1991) isolated and sequenced a human glutamate receptor 
cDNA. The sequence of GLUH1 was predicted to encode a 907 -amino acid protein that 
had 97% identity to one of the rodent kainate receptor subunits. GLUR1 mRNA is widely 
expressed in human brain. The human gene encoding the GluHI subunit is located at 
5q33. While the GluHI gene is not located near a chromosomal region associated with 
any human neurogenetic disorders, the homologous region on mouse chromosome 1 1 
contains the sites of five neurologic mutations. 

Zamanillo et aL (1999) generated mice lacking the AMPA receptor subunit 
GluRA, also known as GluRl , by homologous recombination. Homozygous knockout 
mice exhibited normal development, life expectancy, and fine structure of neuronal 
dendrites and synapses. They were smaller than their littermates during the first postnatal 
weeks, but after weaning their size was normal. In hippocampal CA1 pyramidal neurons, 
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GluRA -/- mice showed a reduction in functional AMPA receptors, with the remaining 
receptors preferentially targeted to synapses. Thus, the CA1 soma-patch currents were 
strongly reduced but glutamatergic synaptic currents were unaltered; evoked dendritic 
and spinous calcium currents, calcium-dependent gene activation, and hippocampal field 
potentials were as in wildtype. In adult GluRA -/- mice, associative long-term 
potentiation was absent in CA3 to CA1 synapses, but spatial learning in the water maze 
was not impaired. The results suggested to Zamanillo et al. (1999) that CA1 hippocampal 
long-term potentiation is controlled by the number or subunit composition of AMPA 
receptors and show a dichotomy between long-term potentiation in CA1 and acquisition 
of spatial memory. For a review related to the present invention, see Puckett et al., Proc. 
Natl. Acad. Sci. U.S.A. 88 (17), 7557-7561 (1991); Gregor et al., Proc. Nat Acad. Sci. 
90: 3053-3057, 1993; Lee et al, Nature 405: 955-959, 2000; Shi et al., Science 284: 
1811-1816, 1999; Sunetal., Proc. Nat. Acad. Sci. 89: 1443-1447, 1992; Warrington et 
al., Genomics 13: 803-808, 1992; Zamanillo et al, Science 284: 1805-1811, 1999. 

Transporter proteins, particularly members of the glutamate receptor subfamily, are 
a major target for drug action and development. Accordingly, it is valuable to the field of 
pharmaceutical development to identify and characterize previously unknown transport 
proteins. The present invention advances the state of the art by providing previously 
unidentified human transport proteins. 

SUMMARY OF THE INVENTION 

The present invention is based in part on the identification of amino acid 
sequences of human transporter peptides and proteins that are related to the glutamate 
receptor subfamily, as well as allelic variants and other mammalian orthologs thereof. 
These unique peptide sequences, and nucleic acid sequences that encode these peptides, 
can be used as models for the development of human therapeutic targets, aid in the 
identification of therapeutic proteins, and serve as targets for the development of human 
therapeutic agents that modulate transporter activity in cells and tissues that express the 
transporter. Experimental data as provided in Figure 1 indicates expression in humans in 
the placenta, nervous system, brain and hippocampus. 

14 



CL001205 



DESCRIPTION OF THE FIGURE SHEETS 

FIGURE 1 provides the nucleotide sequence of a cDNA molecule sequence that 
encodes the transporter protein of the present invention. (SEQIDNO: 1) In addition 
structure and functional information is provided, such as ATG start, stop and tissue 
distribution, where available, that allows one to readily determine specific uses of 
inventions based on this molecular sequence. Experimental data as provided in Figure 1 
indicates expression in humans in the placenta, nervous system, brain and hippocampus. 

FIGURE 2 provides the predicted amino acid sequence of the transporter of the 
present invention. (SEQ ID NO: 2) In addition structure and functional information such 
as protein family, function, and modification sites is provided where available, allowing 
one to readily determine specific uses of inventions based on this molecular sequence. 

FIGURE 3 provides genomic sequences that span the gene encoding the 
transporter protein of the present invention. (SEQ ID NO: 3) In addition structure and 
functional information, such as intron/exon structure, promoter location, etc., is provided 
where available, allowing one to readily determine specific uses of inventions based on 
this molecular sequence. 331 SNPs, including 27 indels, have been identified in the gene 
encoding the receptor protein provided by the present invention and are given in Figure 3. 

DETAILED DESCRIPTION OF THE INVENTION 

General Description 

The present invention is based on the sequencing of the human genome. During 
the sequencing and assembly of the human genome, analysis of the sequence information 
revealed previously unidentified fragments of the human genome that encode peptides 
that share structural and/or sequence homology to protein/peptide/domains identified and 
characterized within the art as being a transporter protein or part of a transporter protein 
and are related to the glutamate receptor subfamily. Utilizing these sequences, additional 
genomic sequences were assembled and transcript and/or cDNA sequences were isolated 
and characterized. Based on this analysis, the present invention provides amino acid 
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sequences of human transporter peptides and proteins that are related to the glutamate 
receptor subfamily, nucleic acid sequences in the form of transcript sequences, cDNA 
sequences and/or genomic sequences that encode these transporter peptides and proteins, 
nucleic acid variation (allelic information), tissue distribution of expression, and 
information about the closest art known protein/peptide/domain that has structural or 
sequence homology to the transporter of the present invention. 

In addition to being previously unknown, the peptides that are provided in the 
present invention are selected based on their ability to be used for the development of 
commercially important products and services. Specifically, the present peptides are 
selected based on homology and/or structural relatedness to known transporter proteins of 
the glutamate receptor subfamily and the expression pattern observed. Experimental data 
as provided in Figure 1 indicates expression in humans in the placenta, nervous system, 
brain and hippocampus.. The art has clearly established the commercial importance of 
members of this family of proteins and proteins that have expression patterns similar to 
that of the present gene. Some of the more specific features of the peptides of the present 
invention, and the uses thereof, are described herein, particularly in the Background of 
the Invention and in the annotation provided in the Figures, and/or are known within the 
art for each of the known glutamate receptor family or subfamily of transporter proteins. 

Specific Embodiments 
Peptide Molecules 

The present invention provides nucleic acid sequences that encode protein 
molecules that have been identified as being members of the transporter family of 
proteins and are related to the glutamate receptor subfamily (protein sequences are 
provided in Figure 2, transcript/cDNA sequences are provided in Figures 1 and genomic 
sequences are provided in Figure 3). The peptide sequences provided in Figure 2, as well 
as the obvious variants described herein, particularly allelic variants as identified herein 
and using the information in Figure 3, will be referred herein as the transporter peptides 
of the present invention, transporter peptides, or peptides/proteins of the present 
invention. 
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The present invention provides isolated peptide and protein molecules that consist 
of, consist essentially of, or comprising the amino acid sequences of the transporter 
peptides disclosed in the Figure 2, (encoded by the nucleic acid molecule shown in 
Figure 1, transcript/cDNA or Figure 3, genomic sequence), as well as all obvious variants 
of these peptides that are within the art to make and use. Some of these variants are 
described in detail below. 

As used herein, a peptide is said to be "isolated" or "purified" when it is 
substantially free of cellular material or free of chemical precursors or other chemicals. 
The peptides of the present invention can be purified to homogeneity or other degrees of 
purity. The level of purification will be based on the intended use. The critical feature is 
that the preparation allows for the desired function of the peptide, even if in the presence of 
considerable amounts of other components (the features of an isolated nucleic acid molecule 
is discussed below). 

In some uses, "substantially free of cellular material" includes preparations of the 
peptide having less than about 30% (by dry weight) other proteins (i.e., contaminating 
protein), less than about 20% other proteins, less than about 1 0% other proteins, or less than 
about 5% other proteins. When the peptide is recombinantly produced, it can also be 
substantially free of culture medium, i.e., culture medium represents less than about 20% of 
the volume of the protein preparation. 

The language "substantially free of chemical precursors or other chemicals" includes 
preparations of the peptide in which it is separated from chemical precursors or other 
chemicals that are involved in its synthesis. In one embodiment, the language "substantially 
free of chemical precursors or other chemicals" includes preparations of the transporter 
peptide having less than about 30% (by dry weight) chemical precursors or other chemicals, 
less than about 20% chemical precursors or other chemicals, less than about 10% chemical 
precursors or other chemicals, or less than about 5% chemical precursors or other chemicals. 

The isolated transporter peptide can be purified from cells that naturally express it, 
purified from cells that have been altered to express it (recombinant), or synthesized using 
known protein synthesis methods. Experimental data as provided in Figure 1 indicates 
expression in humans in the placenta, nervous system, brain and hippocampus. For 
example, a nucleic acid molecule encoding the transporter peptide is cloned into an 
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expression vector, the expression vector introduced into a host cell and the protein expressed 
in the host cell. The protein can then be isolated from the cells by an appropriate 
purification scheme using standard protein purification techniques. Many of these 
techniques are described in detail below. 

Accordingly, the present invention provides proteins that consist of the amino acid 
sequences provided in Figure 2 (SEQ ID NO:2), for example, proteins encoded by the 
transcript/cDNA nucleic acid sequences shown in Figure 1 (SEQ ID NO:l) and the genomic 
sequences provided in Figure 3 (SEQ ID NO:3). The amino acid sequence of such a protein 
is provided in Figure 2. A protein consists of an amino acid sequence when the amino acid 
sequence is the final amino acid sequence of the protein. 

The present invention further provides proteins that consist essentially of the amino 
acid sequences provided in Figure 2 (SEQ ID NO:2), for example, proteins encoded by the 
transcript/cDNA nucleic acid sequences shown in Figure 1 (SEQ ID NO:l) and the genomic 
sequences provided in Figure 3 (SEQ ID NO:3). A protein consists essentially of an amino 
acid sequence when such an amino acid sequence is present with only a few additional 
amino acid residues, for example from about 1 to about 100 or so additional residues, 
typically from 1 to about 20 additional residues in the final protein. 

The present invention further provides proteins that comprise the amino acid 
sequences provided in Figure 2 (SEQ ID NO:2), for example, proteins encoded by the 
transcript/cDNA nucleic acid sequences shown in Figure 1 (SEQ ID NO:l) and the genomic 
sequences provided in Figure 3 (SEQ ID NO:3). A protein comprises an amino acid 
sequence when the amino acid sequence is at least part of the final amino acid sequence of 
the protein. In such a fashion, the protein can be only the peptide or have additional amino 
acid molecules, such as amino acid residues (contiguous encoded sequence) that are 
naturally associated with it or heterologous amino acid residues/peptide sequences. Such a 
protein can have a few additional amino acid residues or can comprise several hundred or 
more additional amino acids. The preferred classes of proteins that are comprised of the 
transporter peptides of the present invention are the naturally occurring mature proteins. A 
brief description of how various types of these proteins can be made/isolated is provided 
below. 
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The transporter peptides of the present invention can be attached to heterologous 
sequences to form chimeric or fusion proteins. Such chimeric and fusion proteins comprise 
a transporter peptide operatively linked to a heterologous protein having an amino acid 
sequence not substantially homologous to the transporter peptide. "Operatively linked" 
indicates that the transporter peptide and the heterologous protein are fused in-frame. The 
heterologous protein can be fused to the N-terminus or C-terminus of the transporter 
peptide. 

In some uses, the fusion protein does not affect the activity of the transporter peptide 
per se. For example, the fusion protein can include, but is not limited to, enzymatic fusion 
proteins, for example beta-galactosidase fusions, yeast two-hybrid GAL fusions, poly-His 
fusions, MYC-tagged, Hi-tagged and Ig fusions. Such fusion proteins, particularly poly-His 
fusions, can facilitate the purification of recombinant transporter peptide. In certain host 
cells (e.g., mammalian host cells), expression and/or secretion of a protein can be increased 
by using a heterologous signal sequence. 

A chimeric or fusion protein can be produced by standard recombinant DNA 
techniques. For example, DNA fragments coding for the different protein sequences are 
ligated together in-frame in accordance with conventional techniques. In another 
embodiment, the fusion gene can be synthesized by conventional techniques including 
automated DNA synthesizers. Alternatively, PCR amplification of gene fragments can be 
carried out using anchor primers which give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be annealed and re-amplified to 
generate a chimeric gene sequence (see Ausubel et ah, Current Protocols in Molecular 
Biology, 1992). Moreover, many expression vectors are commercially available that already 
encode a fusion moiety (e.g., a GST protein). A transporter peptide-encoding nucleic acid 
can be cloned into such an expression vector such that the fusion moiety is linked in-frame 
to the transporter peptide. 

As mentioned above, the present invention also provides and enables obvious 
variants of the amino acid sequence of the proteins of the present invention, such as 
naturally occurring mature forms of the peptide, allelic/sequence variants of the peptides, 
non-naturally occurring recombinant^ derived variants of the peptides, and orthologs and 
paralogs of the peptides. Such variants can readily be generated using art-known techniques 
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in the fields of recombinant nucleic acid technology and protein biochemistry. It is 
understood, however, that variants exclude any amino acid sequences disclosed prior to the 
invention. 

Such variants can readily be identified/made using molecular techniques and the 
sequence information disclosed herein. Further, such variants can readily be distinguished 
from other peptides based on sequence and/or structural homology to the transporter 
peptides of the present invention. The degree of homology/identity present will be based 
primarily on whether the peptide is a functional variant or non-functional variant, the 
amount of divergence present in the paralog family and the evolutionary distance between 
the orthologs. 

To determine the percent identity of two amino acid sequences or two nucleic 
acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps 
can be introduced in one or both of a first and a second amino acid or nucleic acid 
sequence for optimal alignment and non-homologous sequences can be disregarded for 
comparison purposes). In a preferred embodiment, at least 30%, 40%, 50%, 60%, 70%, 
80%, or 90% or more of a reference sequence is aligned for comparison purposes. The 
amino acid residues or nucleotides at corresponding amino acid positions or nucleotide 
positions are then compared. When a position in the first sequence is occupied by the 
same amino acid residue or nucleotide as the corresponding position in the second 
sequence, then the molecules are identical at that position (as used herein amino acid or 
nucleic acid "identity" is equivalent to amino acid or nucleic acid "homology"). The 
percent identity between the two sequences is a function of the number of identical 
positions shared by the sequences, taking into account the number of gaps, and the length 
of each gap, which need to be introduced for optimal alignment of the two sequences. 

The comparison of sequences and determination of percent identity and similarity 
between two sequences can be accomplished using a mathematical algorithm. 
{Computational Molecular Biology \ Lesk, A.M., ed., Oxford University Press, New York, 
1988; Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., Academic Press, 
New York, 1993; Computer Analysis of Sequence Data, Part i, Griffin, A.M., and Griffin, 
H.G., eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von 
Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and 
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Devereux, J., eds., M Stockton Press, New York, 1991). In a preferred embodiment, the 
percent identity between two amino acid sequences is determined using the Needleman 
and Wunsch (J. Mol Biol. (48):444-453 (1970)) algorithm which has been incorporated 
into the GAP program in the GCG software package (available at http://www.gcg.com), 
using either a Blossom 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 
10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet another preferred 
embodiment, the percent identity between two nucleotide sequences is determined using 
the GAP program in the GCG software package (Devereux, J., et al, Nucleic Acids Res. 
12(1):3S7 (1984)) (available at http://www.gcg.com), using a NWSgapcina.CMP matrix 
and a gap weight of 40, 50, 60, 70, or 80 and a length weight of 1 , 2, 3, 4, 5, or 6. In 
another embodiment, the percent identity between two amino acid or nucleotide 
sequences is determined using the algorithm of E. Myers and W. Miller (CABIOS, 4:1 1- 
1 7 (1 989)) which has been incorporated into the ALIGN program (version 2.0), using a 
PAM120 weight residue table, a gap length penalty of 12 and a gap penalty of 4. 

The nucleic acid and protein sequences of the present invention can further be 
used as a "query sequence" to perform a search against sequence databases to, for 
example, identify other family members or related sequences. Such searches can be 
performed using the NBLAST and XBLAST programs (version 2.0) of Altschul, et al (J. 
Mol Biol 215:403-10 (1990)). BLAST nucleotide searches can be performed with the 
NBLAST program, score = 100, wordlength = 12 to obtain nucleotide sequences 
homologous to the nucleic acid molecules of the invention. BLAST protein searches can 
be performed with the XBLAST program, score = 50, wordlength = 3 to obtain amino 
acid sequences homologous to the proteins of the invention. To obtain gapped 
alignments for comparison purposes, Gapped BLAST can be utilized as described in 
Altschul et al {Nucleic Acids Res. 25(17):3389-3402 (1997)). When utilizing BLAST 
and gapped BLAST programs, the default parameters of the respective programs (e.g., 
XBLAST and NBLAST) can be used. 

Full-length pre-processed forms, as well as mature processed forms, of proteins that 
comprise one of the peptides of the present invention can readily be identified as having 
complete sequence identity to one of the transporter peptides of the present invention as well 
as being encoded by the same genetic locus as the transporter peptide provided herein. As 
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indicated by the data presented in Figure 3, the map position was determined to be on 
chromosome 5 by ePCR. 

Allelic variants of a transporter peptide can readily be identified as being a human 
protein having a high degree (significant) of sequence homology/identity to at least a portion 
of the transporter peptide as well as being encoded by the same genetic locus as the 
transporter peptide provided herein. Genetic locus can readily be determined based on the 
genomic information provided in Figure 3, such as the genomic sequence mapped to the 
reference human. As indicated by the data presented in Figure 3, the map position was 
determined to be on chromosome 5 by ePCR. As used herein, two proteins (or a region of 
the proteins) have significant homology when the amino acid sequences are typically at 
least about 70-80%, 80-90%, and more typically at least about 90-95% or more 
homologous. A significantly homologous amino acid sequence, according to the present 
invention, will be encoded by a nucleic acid sequence that will hybridize to a transporter 
peptide encoding nucleic acid molecule under stringent conditions as more fully 
described below. 

Figure 3 provides information on SNPs that have been identified in a gene 
encoding the transporter protein of the present invention. 331 SNP variants were found, 
including 27 indels (indicated by a "-") and 1 SNP in exons. SNPs, identified at different 
nucleotide positions in introns and regions 5' and 3' of the ORF, may affect 
control/regulatory elements. 

Paralogs of a transporter peptide can readily be identified as having some degree of 
significant sequence homology/identity to at least a portion of the transporter peptide, as 
being encoded by a gene from humans, and as having similar activity or function. Two 
proteins will typically be considered paralogs when the amino acid sequences are 
typically at least about 60% or greater, and more typically at least about 70% or greater 
homology through a given region or domain. Such paralogs will be encoded by a nucleic 
acid sequence that will hybridize to a transporter peptide encoding nucleic acid molecule 
under moderate to stringent conditions as more fully described below. 

Orthologs of a transporter peptide can readily be identified as having some degree of 
significant sequence homology/identity to at least a portion of the transporter peptide as well 
as being encoded by a gene from another organism. Preferred orthologs will be isolated 
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from mammals, preferably primates, for the development of human therapeutic targets and 
agents. Such orthologs will be encoded by a nucleic acid sequence that will hybridize to a 
transporter peptide encoding nucleic acid molecule under moderate to stringent 
conditions, as more fully described below, depending on the degree of relatedness of the 
two organisms yielding the proteins. 

Non-naturally occurring variants of the transporter peptides of the present invention 
can readily be generated using recombinant techniques. Such variants include, but are not 
limited to deletions, additions and substitutions in the amino acid sequence of the transporter 
peptide. For example, one class of substitutions are conserved amino acid substitution. 
Such substitutions are those that substitute a given amino acid in a transporter peptide by 
another amino acid of like characteristics. Typically seen as conservative substitutions are 
the replacements, one for another, among the aliphatic amino acids Ala, Val, Leu, and He; 
interchange of the hydroxyl residues Ser and Thr; exchange of the acidic residues Asp and 
Glu; substitution between the amide residues Asn and Gin; exchange of the basic residues 
Lys and Arg; and replacements among the aromatic residues Phe and Tyr. Guidance 
concerning which amino acid changes are likely to be phenotypically silent are found in 
Bowie etal, Science 247:1306-1310 (1990). 

Variant transporter peptides can be fully functional or can lack function in one or 
more activities, e.g. ability to bind ligand, ability to transport ligand, ability to mediate 
signaling, etc. Fully functional variants typically contain only conservative variation or 
variation in non-critical residues or in non-critical regions. Figure 2 provides the result of 
protein analysis and can be used to identify critical domains/regions. Functional variants 
can also contain substitution of similar amino acids that result in no change or an 
insignificant change in function. Alternatively, such substitutions may positively or 
negatively affect function to some degree. 

Non-functional variants typically contain one or more non-conservative amino acid 
substitutions, deletions, insertions, inversions, or truncation or a substitution, insertion, 
inversion, or deletion in a critical residue or critical region. 

Amino acids that are essential for function can be identified by methods known in 
the art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham et 
aL, Science 244:1081-1085 (1989)), particularly using the results provided in Figure 2. The 
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latter procedure introduces single alanine mutations at eveiy residue in the molecule. The 
resulting mutant molecules are then tested for biological activity such as transporter activity 
or in assays such as an in vitro proliferative activity. Sites that are critical for binding 
partner/substrate binding can also be determined by structural analysis such as 
crystallization, nuclear magnetic resonance or photoaffinity labeling (Smith et al, J. Mol 
Biol 224:899-904 (1992); de Vos et al Science 255:306-312 (1992)). 

The present invention further provides fragments of the transporter peptides, in 
addition to proteins and peptides that comprise and consist of such fragments, particularly 
those comprising the residues identified in Figure 2. The fragments to which the invention 
pertains, however, are not to be construed as encompassing fragments that may be disclosed 
publicly prior to the present invention. 

As used herein, a fragment comprises at least 8, 10, 12, 14, 16, or more contiguous 
amino acid residues from a transporter peptide. Such fragments can be chosen based on the 
ability to retain one or more of the biological activities of the transporter peptide or could be 
chosen for the ability to perform a function, e.g, bind a substrate or act as an immunogen* 
Particularly important fragments are biologically active fragments, peptides that are, for 
example, about 8 or more amino acids in length. Such fragments will typically comprise a 
domain or motif of the transporter peptide, e.g., active site, a transmembrane domain or a 
substrate-binding domain. Further, possible fragments include, but are not limited to, 
domain or motif containing fragments, soluble peptide fragments, and fragments containing 
immunogenic structures. Predicted domains and functional sites are readily identifiable by 
computer programs well known and readily available to those of skill in the art (e.g., 
PROSITE analysis). The results of one such analysis are provided in Figure 2. 

Polypeptides often contain amino acids other than the 20 amino acids commonly 
referred to as the 20 naturally occurring amino acids. Further, many amino acids, including 
the terminal amino acids, may be modified by natural processes, such as processing and 
other post-translational modifications, or by chemical modification techniques well known 
in the art. Common modifications that occur naturally in transporter peptides are described 
in basic texts, detailed monographs, and the research literature, and they are well known to 
those of skill in the art (some of these features are identified in Figure 2). 



24 



CL001205 



Known modifications include, but are not limited to, acetylation, acylation, ADP- 
ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme 
moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of 
a lipid or lipid derivative, covalent attachment of phosphotidylinositol, cross-linking, 
cyclization, disulfide bond formation, demethylation, formation of covalent crosslinks, 
formation of cystine, formation of pyroglutamate, formylation, gamma carboxylation, 
glycosylation, GPI anchor formation, hydroxylation, iodination, methylation, 
myristoylation, oxidation, proteolytic processing, phosphorylation, prenylation, 
racemization, selenoylation, sulfation, transfer-RNA mediated addition of amino acids to 
proteins such as arginylation, and ubiquitination. 

Such modifications are well known to those of skill in the art and have been 
described in great detail in the scientific literature. Several particularly common 
modifications, glycosylation, lipid attachment, sulfation, gamma-carboxylation of glutamic 
acid residues, hydroxylation and ADP-ribosylation, for instance, are described in most basic 
texts, such as Proteins - Structure and Molecular Properties, 2nd Ed., T.E. Creighton, W. H. 
Freeman and Company, New York (1993). Many detailed reviews are available on this 
subject, such as by Wold, F., Posttranslatxonal Covalent Modification of Proteins, B.C. 
Johnson, Ed., Academic Press, New York 1-12 (1983); Seifter et al (Metk Enzymol 182: 
626-646 (1990)) and Rattan et al (Ann. K Y. Acad Sci. 663:48-62 (1992)). 

Accordingly, the transporter peptides of the present invention also encompass 
derivatives or analogs in which a substituted amino acid residue is not one encoded by the 
genetic code, in which a substituent group is included, in which the mature transporter 
peptide is fused with another compound, such as a compound to increase the half-life of the 
transporter peptide (for example, polyethylene glycol), or in which the additional amino 
acids are fused to the mature transporter peptide, such as a leader or secretory sequence or a 
sequence for purification of the mature transporter peptide or a pro-protein sequence. 

Protein/Peptide Uses 

The proteins of the present invention can be used in substantial and specific 
assays related to the functional information provided in the Figures; to raise antibodies or 
to elicit another immune response; as a reagent (including the labeled reagent) in assays 
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designed to quantitatively determine levels of the protein (or its binding partner or ligand) 
in biological fluids; and as markers for tissues in which the corresponding protein is 
preferentially expressed (either constitutively or at a particular stage of tissue 
differentiation or development or in a disease state). Where the protein binds or 
potentially binds to another protein or ligand (such as, for example, in a transporter- 
effector protein interaction or transporter-ligand interaction), the protein can be used to 
identify the binding partner/ligand so as to develop a system to identify inhibitors of the 
binding interaction. Any or all of these uses are capable of being developed into reagent 
grade or kit format for commercialization as commercial products. 

Methods for performing the uses listed above are well known to those skilled in 
the art. References disclosing such methods include "Molecular Cloning: A Laboratory 
Manual", 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. Fritsch and T. 
Maniatis eds., 1989, and "Methods in Enzymology: Guide to Molecular Cloning 
Techniques", Academic Press, Berger, S. L. and A. R. Kimmel eds., 1987. 

The potential uses of the peptides of the present invention are based primarily on 
the source of the protein as well as the class/action of the protein. For example, 
transporters isolated from humans and their human/mammalian orthologs serve as targets 
for identifying agents for use in mammalian therapeutic applications, e.g. a human drug, 
particularly in modulating a biological or pathological response in a cell or tissue that 
expresses the transporter. Experimental data as provided in Figure 1 indicates that the 
transporter proteins of the present invention are expressed in humans in the placenta, 
nervous system and brain detected by a virtual northern blot. In addition, PCR-based 
tissue screening panels indicate expression in hippocampus. A large percentage of 
pharmaceutical agents are being developed that modulate the activity of transporter 
proteins, particularly members of the glutamate receptor subfamily (see Background of 
the Invention). The structural and functional information provided in the Background 
and Figures provide specific and substantial uses for the molecules of the present 
invention, particularly in combination with the expression information provided in Figure 
1. Experimental data as provided in Figure 1 indicates expression in humans in the placenta, 
nervous system, brain and hippocampus. Such uses can readily be determined using the 
information provided herein, that known in the art and routine experimentation. 
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The proteins of the present invention (including variants and fragments that may 
have been disclosed prior to the present invention) are useful for biological assays related to 
transporters that are related to members of the glutamate receptor subfamily. Such assays 
involve any of the known transporter functions or activities or properties useful for diagnosis 
and treatment of transporter-related conditions that are specific for the subfamily of 
transporters that the one of the present invention belongs to, particularly in cells and tissues 
that express the transporter. Experimental data as provided in Figure 1 indicates that the 
transporter proteins of the present invention are expressed in humans in the placenta, 
nervous system and brain detected by a virtual northern blot. In addition, PCR-based 
tissue screening panels indicate expression in hippocampus. The proteins of the present 
invention are also useful in drug screening assays, in cell-based or cell-free systems 
((Hodgson, Bio/technology, 1992, Sept 10(9);973-80). Cell-based systems can be native, 
i.e., cells that normally express the transporter, as a biopsy or expanded in cell culture. 
Experimental data as provided in Figure 1 indicates expression in humans in the placenta, 
nervous system, brain and hippocampus. In an alternate embodiment, cell-based assays 
involve recombinant host cells expressing the transporter protein. 

The polypeptides can be used to identify compounds that modulate transporter 
activity of the protein in its natural state or an altered form that causes a specific disease or 
pathology associated with the transporter. Both the transporters of the present invention and 
appropriate variants and fragments can be used in high-throughput screens to assay 
candidate compounds for the ability to bind to the transporter. These compounds can be 
further screened against a functional transporter to determine the effect of the compound on 
the transporter activity. Further, these compounds can be tested in animal or invertebrate 
systems to determine activity/effectiveness. Compounds can be identified that activate 
(agonist) or inactivate (antagonist) the transporter to a desired degree. 

Further, the proteins of the present invention can be used to screen a compound for 
the ability to stimulate or inhibit interaction between the transporter protein and a molecule 
that normally interacts with the transporter protein, e.g. a substrate or a component of the 
signal pathway that the transporter protein normally interacts (for example, another 
transporter). Such assays typically include the steps of combining the transporter protein 
with a candidate compound under conditions that allow the transporter protein, or fragment, 
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to interact with the target molecule, and to detect the formation of a complex between the 
protein and the target or to detect the biochemical consequence of the interaction with the 
transporter protein and the target, such as any of the associated effects of signal transduction 
such as changes in membrane potential, protein phosphorylation, cAMP turnover, and 
adenylate cyclase activation, etc. 

Candidate compounds include, for example, 1) peptides such as soluble peptides, 
including Ig-tailed fusion peptides and members of random peptide libraries (see, e.g., Lam 
etal, Nature 354:82-84 (1991); Houghtener al, Nature 554:84-86 (1991)) and 
combinatorial chemistry-derived molecular libraries made of D- and/or L- configuration 
amino acids; 2) phosphopeptides (e.g., members of random and partially degenerate, 
directed phosphopeptide libraries, see, e.g., Songyang et al 9 Cell 72:767-778 (1993)); 3) 
antibodies (e.g., polyclonal, monoclonal, humanized, anti-idiotypic, chimeric, and single 
chain antibodies as well as Fab, F(ab% Fab expression library fragments, and epitope- 
binding fragments of antibodies); and 4) small organic and inorganic molecules (e.g., 
molecules obtained from combinatorial and natural product libraries). 

One candidate compound is a soluble fragment of the receptor that competes for 
ligand binding. Other candidate compounds include mutant transporters or appropriate 
fragments containing mutations that affect transporter function and thus compete for ligand. 
Accordingly, a fragment that competes for ligand, for example with a higher affinity, or a 
fragment that binds ligand but does not allow release, is encompassed by the invention. 

The invention further includes other end point assays to identify compounds that 
modulate (stimulate or inhibit) transporter activity. The assays typically involve an assay of 
events in the signal transduction pathway that indicate transporter activity. Thus, the 
transport of a ligand, change in cell membrane potential, activation of a protein, a change in 
the expression of genes that are up- or down-regulated in response to the transporter protein 
dependent signal cascade can be assayed. 

Any of the biological or biochemical functions mediated by the transporter can be 
used as an endpoint assay. These include all of the biochemical or biochemical/biological 
events described herein, in the references cited herein, incorporated by reference for these 
endpoint assay targets, and other functions known to those of ordinary skill in the art or that 
can be readily identified using the information provided in the Figures, particularly Figure 2. 
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Specifically, a biological function of a cell or tissues that expresses the transporter can be 
assayed. Experimental data as provided in Figure 1 indicates that the transporter proteins 
of the present invention are expressed in humans in the placenta, nervous system and 
brain detected by a virtual northern blot. In addition, PCR-based tissue screening panels 
indicate expression in hippocampus. 

Binding and/or activating compounds can also be screened by using chimeric 
transporter proteins in which the amino terminal extracellular domain, or parts thereof, the 
entire transmembrane domain or subregions, such as any of the seven transmembrane 
segments or any of the intracellular or extracellular loops and the carboxy terminal 
intracellular domain, or parts thereof, can be replaced by heterologous domains or 
subregions. For example, a ligand-binding region can be used that interacts with a different 
ligand then that which is recognized by the native transporter. Accordingly, a different set 
of signal transduction components is available as an end-point assay for activation. This 
allows for assays to be performed in other than the specific host cell from which the 
transporter is derived. 

The proteins of the present invention are also useful in competition binding assays in 
methods designed to discover compounds that interact with the transporter (e.g. binding 
partners and/or ligands). Thus, a compound is exposed to a transporter polypeptide under 
conditions that allow the compound to bind or to otherwise interact with the polypeptide. 
Soluble transporter polypeptide is also added to the mixture. If the test compound interacts 
with the soluble transporter polypeptide, it decreases the amount of complex formed or 
activity from the transporter target. This type of assay is particularly useful in cases in 
which compounds are sought that interact with specific regions of the transporter. Thus, the 
soluble polypeptide that competes with the target transporter region is designed to contain 
peptide sequences corresponding to the region of interest- 
To perform cell free drug screening assays, it is sometimes desirable to immobilize 
either the transporter protein, or fragment, or its target molecule to facilitate separation of 
complexes from uncomplexed forms of one or both of the proteins, as well as to 
accommodate automation of the assay. 

Techniques for immobilizing proteins on matrices can be used in the drug screening 
assays. In one embodiment, a fusion protein can be provided which adds a domain that 
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allows the protein to be bound to a matrix. For example, glutathione-S-transferase fusion 
proteins can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, 
MO) or glutathione derivatized microtitre plates, which are then combined with the cell 
lysates (e.g., 35 S-labeled) and the candidate compound, and the mixture incubated under 
conditions conducive to complex formation (e.g., at physiological conditions for salt and 
pH). Following incubation, the beads are washed to remove any unbound label, and the 
matrix immobilized and radiolabel determined directly, or in the supernatant after the 
complexes are dissociated. Alternatively, the complexes can be dissociated from the matrix, 
separated by SDS-PAGE, and the level of transporter-binding protein found in the bead 
fraction quantitated from the gel using standard electrophoretic techniques. For example, 
either the polypeptide or its target molecule can be immobilized utilizing conjugation of 
biotin and streptavidin using techniques well known in the art. Alternatively, antibodies 
reactive with the protein but which do not interfere with binding of the protein to its target 
molecule can be derivatized to the wells of the plate, and the protein trapped in the wells by 
antibody conjugation. Preparations of a transporter-binding protein and a candidate 
compound are incubated in the transporter protein-presenting wells and the amount of 
complex trapped in the well can be quantitated. Methods for detecting such complexes, in 
addition to those described above for the GST-immobilized complexes, include 
immunodetection of complexes using antibodies reactive with the transporter protein target 
molecule, or which are reactive with transporter protein and compete with the target 
molecule, as well as enzyme-linked assays which rely on detecting an enzymatic activity 
associated with the target molecule. 

Agents that modulate one of the transporters of the present invention can be 
identified using one or more of the above assays, alone or in combination. It is generally 
preferable to use a cell-based or cell free system first and then confirm activity in an animal 
or other model system. Such model systems are well known in the art and can readily be 
employed in mis context. 

Modulators of transporter protein activity identified according to these drug 
screening assays can be used to treat a subject with a disorder mediated by the transporter 
pathway, by treating cells or tissues that express the transporter. Experimental data as 
provided in Figure 1 indicates expression in humans in the placenta, nervous system, brain 
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and hippocampus. These methods of treatment include the steps of administering a 
modulator of transporter activity in a pharmaceutical composition to a subject in need of 
such treatment, the modulator being identified as described herein. 

In yet another aspect of the invention, the transporter proteins can be used as "bait 
proteins" in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. Patent No. 
5,283,317; Zervos et al. (1993) Cell 72:223-232; Madura et al. (1993) J. Biol. Chem. 
268:12046-12054; Barrel etal. (1993) Biotechniques 14:920-924; Iwabuchi et al. (1993) 
Oncogene 8:1693-1696; and Brent WO94/10300), to identify other proteins, which bind 
to or interact with the transporter and are involved in transporter activity. Such 
transporter-binding proteins are also likely to be involved in the propagation of signals by 
the transporter proteins or transporter targets as, for example, downstream elements of a 
transporter-mediated signaling pathway. Alternatively, such transporter-binding proteins 
are likely to be transporter inhibitors. 

The two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. Briefly, the 
assay utilizes two different DNA constructs. In one construct, the gene that codes for a 
transporter protein is fused to a gene encoding the DNA binding domain of a known 
transcription factor (e.g., GAL-4). In the other construct, a DNA sequence, from a library 
of DNA sequences, that encodes an unidentified protein ("prey" or "sample") is fused to a 
gene that codes for the activation domain of the known transcription factor. If the "bait" 
and the "prey" proteins are able to interact, in vivo, forming a transporter-dependent 
complex, the DNA-binding and activation domains of the transcription factor are brought 
into close proximity. This proximity allows transcription of a reporter gene (e.g., LacZ) 
which is operably linked to a transcriptional regulatory site responsive to the transcription 
factor. Expression of the reporter gene can be detected and cell colonies containing the 
functional transcription factor can be isolated and used to obtain the cloned gene which 
encodes the protein which interacts with the transporter protein. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 
agent identified as described herein in an appropriate animal model. For example, an 
agent identified as described herein (e.g., a transporter-modulating agent, an antisense 
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transporter nucleic acid molecule, a transporter-specific antibody, or a transporter- 
binding partner) can be used in an animal or other model to determine the efficacy, 
toxicity, or side effects of treatment with such an agent. Alternatively, an agent identified 
as described herein can be used in an animal or other model to determine the mechanism 
of action of such an agent. Furthermore, this invention pertains to uses of novel agents 
identified by the above-described screening assays for treatments as described herein. 

The transporter proteins of the present invention are also useful to provide a target 
for diagnosing a disease or predisposition to disease mediated by the peptide. Accordingly, 
the invention provides methods for detecting the presence, or levels of, the protein (or 
encoding mRNA) in a cell, tissue, or organism. Experimental data as provided in Figure 1 
indicates expression in humans in the placenta, nervous system, brain and hippocampus. 
The method involves contacting a biological sample with a compound capable of interacting 
with the transporter protein such that die interaction can be detected. Such an assay can be 
provided in a single detection format or a multi-detection format such as an antibody chip 
array. 

One agent for detecting a protein in a sample is an antibody capable of selectively 
binding to protein. A biological sample includes tissues, cells and biological fluids isolated 
from a subject, as well as tissues, cells and fluids present within a subject. 

The peptides of the present invention also provide targets for diagnosing active 
protein activity, disease, or predisposition to disease, in a patient having a variant peptide, 
particularly activities and conditions that are known for other members of the family of 
proteins to which the present one belongs. Thus, the peptide can be isolated from a 
biological sample and assayed for the presence of a genetic mutation that results in aberrant 
peptide. This includes amino acid substitution, deletion, insertion, rearrangement, (as the 
result of aberrant splicing events), and inappropriate post-translational modification. 
Analytic methods include altered electrophoretic mobility, altered tryptic peptide digest, 
altered transporter activity in cell-based or cell-free assay, alteration in ligand or antibody- 
binding pattern, altered isoelectric point, direct amino acid sequencing, and any other of the 
known assay techniques useful for detecting mutations in a protein. Such an assay can be 
provided in a single detection format or a multi-detection format such as an antibody chip 
array. 
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In vitro techniques for detection of peptide include enzyme linked immunosorbent 
assays (ELISAs), Western blots, immunoprecipitations and immunofluorescence using a 
detection reagent, such as an antibody or protein binding agent. Alternatively, the peptide 
can be detected in vivo in a subject by introducing into the subject a labeled anti-peptide 
antibody or other types of detection agent. For example, the antibody can be labeled with a 
radioactive marker whose presence and location in a subject can be detected by standard 
imaging techniques. Particularly useful are methods that detect the allelic variant of a 
peptide expressed in a subject and methods which detect fragments of a peptide in a sample. 

The peptides are also useful in pharmacogenomic analysis. Pharmacogenomics deal 
with clinically significant hereditary variations in the response to drugs due to altered drug 
disposition and abnormal action in affected persons. See, e.g., Eichelbaum, M. {Clin. Exp, 
Pharmacol Physiol 23(10-1 1):983-985 (1996)), and Linder, M.W. {Clin. Chem. 43(2):254- 
266 (1997)). The clinical outcomes of these variations result in severe toxicity of 
therapeutic drugs in certain individuals or therapeutic failure of drugs in certain individuals 
as a result of individual variation in metabolism. Thus, the genotype of the individual can 
determine the way a therapeutic compound acts on the body or the way the body 
metabolizes the compound. Further, the activity of drug metabolizing enzymes effects both 
the intensity and duration of drug action. Thus, the pharmacogenomics of the individual 
permit the selection of effective compounds and effective dosages of such compounds for 
prophylactic or therapeutic treatment based on the individual's genotype. The discovery of 
genetic polymorphisms in some drug metabolizing enzymes has explained why some 
patients do not obtain the expected drug effects, show an exaggerated drug effect, or 
experience serious toxicity from standard drug dosages. Polymorphisms can be expressed 
in the phenotype of the extensive metabolizer and the phenotype of the poor metabolizer. 
Accordingly, genetic polymorphism may lead to allelic protein variants of the transporter 
protein in which one or more of the transporter functions in one population is different from 
those in another population. The peptides thus allow a target to ascertain a genetic 
predisposition that can affect treatment modality. Thus, in a ligand-based treatment, 
polymorphism may give rise to amino terminal extracellular domains and/or other ligand- 
binding regions that are more or less active in ligand binding, and transporter activation. 
Accordingly, ligand dosage would necessarily be modified to maximize the therapeutic 
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effect within a given population containing a polymorphism. As an alternative to 
genotyping, specific polymorphic peptides could be identified. 

The peptides are also useful for treating a disorder characterized by an absence of, 
inappropriate, or unwanted expression of the protein. Experimental data as provided in 
Figure 1 indicates expression in humans in the placenta, nervous system, brain and 
hippocampus. Accordingly, methods for treatment include the use of the transporter protein 
or fragments. 

Antibodies 

The invention also provides antibodies that selectively bind to one of the peptides of 
the present invention, a protein comprising such a peptide, as well as variants and fragments 
thereof. As used herein, an antibody selectively binds a target peptide when it binds the 
target peptide and does not significantly bind to unrelated proteins. An antibody is still 
considered to selectively bind a peptide even if it also binds to other proteins that are not 
substantially homologous with the target peptide so long as such proteins share homology 
with a fragment or domain of the peptide target of the antibody. In this case, it would be 
understood that antibody binding to the peptide is still selective despite some degree of 
cross-reactivity. 

As used herein, an antibody is defined in terms consistent with that recognized 
within the art: they are multi-subunit proteins produced by a mammalian organism in 
response to an antigen challenge. The antibodies of the present invention include polyclonal 
antibodies and monoclonal antibodies, as well as fragments of such antibodies, including, 
but not limited to, Fab or F(ab')2 ? and Fv fragments. 

Many methods are known for generating and/or identifying antibodies to a given 
target peptide. Several such methods are described by Harlow, Antibodies, Cold Spring 
Harbor Press, (1989). 

In general, to generate antibodies, an isolated peptide is used as an immunogen and 
is administered to a mammalian organism, such as a rat, rabbit or mouse. The full-length 
protein, an antigenic peptide fragment or a fusion protein can be used. Particularly 
important fragments are those covering functional domains, such as the domains identified 
in Figure 2, and domain of sequence homology or divergence amongst the family, such as 
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those that can readily be identified using protein alignment methods and as presented in the 
Figures. 

Antibodies are preferably prepared from regions or discrete fragments of the 
transporter proteins. Antibodies can be prepared from any region of the peptide as 
described herein. However, preferred regions will include those involved in 
function/activity and/or transporter/binding partner interaction. Figure 2 can be used to 
identify particularly important regions while sequence alignment can be used to identify 
conserved and unique sequence fragments. 

An antigenic fragment will typically comprise at least 8 contiguous amino acid 
residues. The antigenic peptide can comprise, however, at least 10, 12, 14, 16 or more 
amino acid residues. Such fragments can be selected on a physical property, such as 
fragments correspond to regions that are located on the surface of the protein, e.g., 
hydrophilic regions or can be selected based on sequence uniqueness (see Figure 2). 

Detection on an antibody of the present invention can be facilitated by coupling (i.e., 
physically linking) the antibody to a detectable substance. Examples of detectable 
substances include various enzymes, prosthetic groups, fluorescent materials, luminescent 
materials, bioluminescent materials, and radioactive materials. Examples of suitable 
enzymes include horseradish peroxidase, alkaline phosphatase, P-galactosidase, or 
acetylcholinesterase; examples of suitable prosthetic group complexes include 
streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials include 
umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dicWorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes 
luminol; examples of bioluminescent materials include luciferase, luciferin, and aequorin, 
and examples of suitable radioactive material include 125 1, 131 1, 35 S or 3 H. 

Antibody Uses 

The antibodies can be used to isolate one of the proteins of the present invention by 
standard techniques, such as affinity chromatography or immunoprecipitation. The 
antibodies can facilitate the purification of the natural protein from cells and recombinantly 
produced protein expressed in host cells. In addition, such antibodies are useful to detect the 
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presence of one of the proteins of the present invention in cells or tissues to determine the 
pattern of expression of the protein among various tissues in an organism and over the 
course of normal development. Experimental data as provided in Figure 1 indicates that 
the transporter proteins of the present invention are expressed in humans in the placenta, 
nervous system and brain detected by a virtual northern blot. In addition, PCR-based 
tissue screening panels indicate expression in hippocampus. Further, such antibodies can 
be used to detect protein in situ, in vitro, or in a cell lysate or supernatant in order to evaluate 
the abundance and pattern of expression. Also, such antibodies can be used to assess 
abnormal tissue distribution or abnormal expression during development or progression of a 
biological condition. Antibody detection of circulating fragments of the full length protein 
can be used to identify turnover. 

Further, the antibodies can be used to assess expression in disease states such as in 
active stages of the disease or in an individual with a predisposition toward disease related to 
the protein's function. When a disorder is caused by an inappropriate tissue distribution, 
developmental expression, level of expression of the protein, or expressed/processed form, 
the antibody can be prepared against the normal protein. Experimental data as provided in 
Figure 1 indicates expression in humans in the placenta, nervous system, brain and 
hippocampus. If a disorder is characterized by a specific mutation in the protein, antibodies 
specific for this mutant protein can be used to assay for the presence of the specific mutant 
protein. 

The antibodies can also be used to assess normal and aberrant subcellular 
localization of cells in the various tissues in an organism. Experimental data as provided in 
Figure 1 indicates expression in humans in the placenta, nervous system, brain and 
hippocampus. The diagnostic uses can be applied, not only in genetic testing, but also in 
monitoring a treatment modality. Accordingly, where treatment is ultimately aimed at 
correcting expression level or the presence of aberrant sequence and aberrant tissue 
distribution or developmental expression, antibodies directed against the protein or relevant 
fragments can be used to monitor therapeutic efficacy. 

Additionally, antibodies are useful in pharmacogenomic analysis. Thus, antibodies 
prepared against polymorphic proteins can be used to identify individuals that require 
modified treatment modalities. The antibodies are also useful as diagnostic tools as an 
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immunological marker for aberrant protein analyzed by electrophoretic mobility, isoelectric 
point, tryptic peptide digest, and other physical assays known to those in the art. 

The antibodies are also useful for tissue typing. Experimental data as provided in 
Figure 1 indicates expression in humans in the placenta, nervous system, brain and 
hippocampus. Thus, where a specific protein has been correlated with expression in a 
specific tissue, antibodies that are specific for this protein can be used to identify a tissue 
type. 

The antibodies are also useful for inhibiting protein function, for example, blocking 
the binding of the transporter peptide to a binding partner such as a ligand or protein binding 
partner. These uses can also be applied in a therapeutic context in which treatment involves 
inhibiting the protein's function. An antibody can be used, for example, to block binding, 
thus modulating (agonizing or antagonizing) the peptides activity. Antibodies can be 
prepared against specific fragments containing sites required for function or against intact 
protein that is associated with a cell or cell membrane. See Figure 2 for structural 
information relating to the proteins of the present invention. 

The invention also encompasses kits for using antibodies to detect the presence of a 
protein in a biological sample. The kit can comprise antibodies such as a labeled or 
labelable antibody and a compound or agent for detecting protein in a biological sample; 
means for determining the amount of protein in the sample; means for comparing the 
amount of protein in the sample with a standard; and instructions for use. Such a kit can be 
supplied to detect a single protein or epitope or can be configured to detect one of a 
multitude of epitopes, such as in an antibody detection array. Arrays are described in detail 
below for nucleic acid arrays and similar methods have been developed for antibody arrays. 

Nucleic Acid Molecules 

The present invention further provides isolated nucleic acid molecules that encode a 
transporter peptide or protein of the present invention (cDNA, transcript and genomic 
sequence). Such nucleic acid molecules will consist of, consist essentially of, or comprise a 
nucleotide sequence that encodes one of the transporter peptides of the present invention, an 
allelic variant thereof, or an ortholog or paralog thereof. 
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As used herein, an "isolated" nucleic acid molecule is one that is separated from 
other nucleic acid present in the natural source of the nucleic acid. Preferably, an "isolated" 
nucleic acid is free of sequences that naturally flank the nucleic acid (i.e., sequences located 
at the 5' and 3' ends of the nucleic acid) in the genomic DNA of the organism from which 
the nucleic acid is derived. However, there can be some flanking nucleotide sequences, for 
example up to about 5KB, 4KB, 3KB, 2KB, or 1KB or less, particularly contiguous peptide 
encoding sequences and peptide encoding sequences within the same gene but separated by 
introns in the genomic sequence. The important point is that the nucleic acid is isolated 
from remote and unimportant flanking sequences such that it can be subjected to the specific 
manipulations described herein such as recombinant expression, preparation of probes and 
primers, and other uses specific to the nucleic acid sequences. 

Moreover, an "isolated" nucleic acid molecule, such as a transcript/cDNA molecule, 
can be substantially free of other cellular material, or culture medium when produced by 
recombinant techniques, or chemical precursors or other chemicals when chemically 
synthesized. However, the nucleic acid molecule can be fused to other coding or regulatory 
sequences and still be considered isolated. 

For example, recombinant DNA molecules contained in a vector are considered 
isolated. Further examples of isolated DNA molecules include recombinant DNA 
molecules maintained in heterologous host cells or purified (partially or substantially) DNA 
molecules in solution. Isolated RNA molecules include in vivo or in vitro RNA transcripts 
of the isolated DNA molecules of the present invention. Isolated nucleic acid molecules 
according to the present invention further include such molecules produced synthetically. 

Accordingly, the present invention provides nucleic acid molecules that consist of 
the nucleotide sequence shown in Figure 1 or 3 (SEQ ID NO :1, transcript sequence and 
SEQ ID NO:3, genomic sequence), or any nucleic acid molecule that encodes the protein 
provided in Figure 2, SEQ ID NO:2. A nucleic acid molecule consists of a nucleotide 
sequence when the nucleotide sequence is the complete nucleotide sequence of the nucleic 
acid molecule. 

The present invention further provides nucleic acid molecules that consist essentially 
of the nucleotide sequence shown in Figure 1 or 3 (SEQ ID NO:l, transcript sequence and 
SEQ ID NO:3, genomic sequence), or any nucleic acid molecule that encodes the protein 



38 



CL001205 



provided in Figure 2, SEQ ID NO:2. A nucleic acid molecule consists essentially of a 
nucleotide sequence when such a nucleotide sequence is present with only a few additional 
nucleic acid residues in the final nucleic acid molecule. 

The present invention further provides nucleic acid molecules that comprise the 
nucleotide sequences shown in Figure 1 or 3 (SEQ ID NO:l, transcript sequence and SEQ 
ID NO:3, genomic sequence), or any nucleic acid molecule that encodes the protein 
provided in Figure 2, SEQ ID NO:2. A nucleic acid molecule comprises a nucleotide 
sequence when the nucleotide sequence is at least part of the final nucleotide sequence of 
the nucleic acid molecule. In such a fashion, the nucleic acid molecule can be only the 
nucleotide sequence or have additional nucleic acid residues, such as nucleic acid residues 
that are naturally associated with it or heterologous nucleotide sequences. Such a nucleic 
acid molecule can have a few additional nucleotides or can comprise several hundred or 
more additional nucleotides. A brief description of how various types of these nucleic acid 
molecules can be readily made/isolated is provided below. 

In Figures 1 and 3, both coding and non-coding sequences are provided. Because 
of the source of the present invention, humans genomic sequence (Figure 3) and 
cDN A/transcript sequences (Figure 1), the nucleic acid molecules in the Figures will 
contain genomic intronic sequences, 5' and 3' non-coding sequences, gene regulatory 
regions and non-coding intergenic sequences. In general such sequence features are 
either noted in Figures 1 and 3 or can readily be identified using computational tools 
known in the art. As discussed below, some of the non-coding regions, particularly gene 
regulatory elements such as promoters, are useful for a variety of purposes, e.g. control of 
heterologous gene expression, target for identifying gene activity modulating compounds, 
and are particularly claimed as fragments of the genomic sequence provided herein. 

The isolated nucleic acid molecules can encode the mature protein plus additional 
amino or carboxyl-terminal amino acids, or amino acids interior to the mature peptide (when 
the mature form has more than one peptide chain, for instance). Such sequences may play a 
role in processing of a protein from precursor to a mature form, facilitate protein trafficking, 
prolong or shorten protein half-life or facilitate manipulation of a protein for assay or 
production, among other things. As generally is the case in situ, the additional amino acids 
may be processed away from the mature protein by cellular enzymes. 
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As mentioned above, the isolated nucleic acid molecules include, but are not limited 
to, the sequence encoding the transporter peptide alone, the sequence encoding the mature 
peptide and additional coding sequences, such as a leader or secretory sequence (e.g., a pre- 
pro or pro-protein sequence), the sequence encoding the mature peptide, with or without the 
additional coding sequences, plus additional non-coding sequences, for example introns and 
non-coding 5' and 3' sequences such as transcribed but non-translated sequences that play a 
role in transcription, mRNA processing (including splicing and polyadenylation signals), 
ribosome binding and stability of mRNA. In addition, the nucleic acid molecule may be 
fused to a marker sequence encoding, for example, a peptide that facilitates purification. 

Isolated nucleic acid molecules can be in the form of RNA, such as mRNA, or in the 
form DNA, including cDNA and genomic DNA obtained by cloning or produced by 
chemical synthetic techniques or by a combination thereof. The nucleic acid, especially 
DNA, can be double-stranded or single-stranded. Single-stranded nucleic acid can be the 
coding strand (sense strand) or the non-coding strand (anti-sense strand). 

The invention further provides nucleic acid molecules that encode fragments of the 
peptides of the present invention as well as nucleic acid molecules that encode obvious 
variants of the transporter proteins of the present invention that are described above. Such 
nucleic acid molecules may be naturally occurring, such as allelic variants (same locus), 
paralogs (different locus), and orthologs (different organism), or may be constructed by 
recombinant DNA methods or by chemical synthesis. Such non-naturally occurring variants 
may be made by mutagenesis techniques, including those applied to nucleic acid molecules, 
cells, or organisms. Accordingly, as discussed above, the variants can contain nucleotide 
substitutions, deletions, inversions and insertions. Variation can occur in either or both the 
coding and non-coding regions. The variations can produce both conservative and non- 
conservative amino acid substitutions. 

The present invention further provides non-coding fragments of the nucleic acid 
molecules provided in Figures 1 and 3. Preferred non-coding fragments include, but are not 
limited to, promoter sequences, enhancer sequences, gene modulating sequences and gene 
termination sequences. Such fragments are useful in controlling heterologous gene 
expression and in developing screens to identify gene-modulating agents. A promoter can 
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readily be identified as being 5' to the ATG start site in the genomic sequence provided in 
Figure 3. 

A fragment comprises a contiguous nucleotide sequence greater than 12 or more 
nucleotides. Further, a fragment could at least 30, 40, 50, 1 00, 250 or 500 nucleotides in 
length. The length of the fragment will be based on its intended use. For example, the 
fragment can encode epitope bearing regions of the peptide, or can be useful as DNA probes 
and primers. Such fragments can be isolated using the known nucleotide sequence to 
synthesize an oligonucleotide probe. A labeled probe can then be used to screen a cDNA 
library, genomic DNA library, or mRNA to isolate nucleic acid corresponding to the coding 
region. Further, primers can be used in PCR reactions to clone specific regions of gene. 

A probe/primer typically comprises substantially a purified oligonucleotide or 
oligonucleotide pair. The oligonucleotide typically comprises a region of nucleotide 
sequence that hybridizes under stringent conditions to at least about 12, 20, 25, 40, 50 or 
more consecutive nucleotides. 

Orthologs, homologs, and allelic variants can be identified using methods well 
known in the art. As described in the Peptide Section, these variants comprise a nucleotide 
sequence encoding a peptide that is typically 60-70%, 70-80%, 80-90%, and more typically 
at least about 90-95% or more homologous to the nucleotide sequence shown in the Figure 
sheets or a fragment of this sequence. Such nucleic acid molecules can readily be identified 
as being able to hybridize under moderate to stringent conditions, to the nucleotide sequence 
shown in the Figure sheets or a fragment of the sequence. Allelic variants can readily be 
determined by genetic locus of the encoding gene. As indicated by the data presented in 
Figure 3, the map position was determined to be on chromosome 5 by ePCR. 

Figure 3 provides information on SNPs that have been identified in a gene 
encoding the transporter protein of the present invention. 331 SNP variants were found, 
including 27 indels (indicated by a "-") and 1 SNP in exons. SNPs, identified at different 
nucleotide positions in introns and regions 5' and 3' of the ORF, may affect 
control/regulatory elements. 

As used herein, the term "hybridizes under stringent conditions" is intended to 
describe conditions for hybridization and washing under which nucleotide sequences 
encoding a peptide at least 60-70% homologous to each other typically remain hybridized to 
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each other. The conditions can be such that sequences at least about 60%, at least about 
70%, or at least about 80% or more homologous to each other typically remain hybridized to 
each other. Such stringent conditions are known to those skilled in the art and can be found 
in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-63.6. 
One example of stringent hybridization conditions are hybridization in 6X sodium 
chloride/sodium citrate (SSC) at about 45C, followed by one or more washes in 0.2 X SSC, 
0.1% SDS at 50-65C. Examples of moderate to low stringency hybridization conditions are 
well known in the art. 

Nucleic Acid Molecule Uses 

The nucleic acid molecules of the present invention are useful for probes, primers, 
chemical intermediates, and in biological assays. The nucleic acid molecules are useful as a 
hybridization probe for messenger RNA, transcript/cDNA and genomic DNA to isolate full- 
length cDNA and genomic clones encoding the peptide described in Figure 2 and to isolate 
cDNA and genomic clones that correspond to variants (alleles, orthologs, etc.) producing the 
same or related peptides shown in Figure 2. 331 SNPs, including 27 indels, have been 
identified in the gene encoding the receptor protein provided by the present invention and 
are given in Figure 3. 

The probe can correspond to any sequence along the entire length of the nucleic acid 
molecules provided in the Figures. Accordingly, it could be derived from 5' noncoding 
regions, the coding region, and 3' noncoding regions. However, as discussed, fragments are 
not to be construed as encompassing fragments disclosed prior to the present invention. 

The nucleic acid molecules are also useful as primers for PCR to amplify any given 
region of a nucleic acid molecule and are useful to synthesize antisense molecules of desired 
length and sequence. 

The nucleic acid molecules are also useful for constructing recombinant vectors. 
Such vectors include expression vectors that express a portion of, or all of, the peptide 
sequences. Vectors also include insertion vectors, used to integrate into another nucleic acid 
molecule sequence, such as into the cellular genome, to alter in situ expression of a gene 
and/or gene product. For example, an endogenous coding sequence can be replaced via 
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homologous recombination with all or part of the coding region containing one or more 
specifically introduced mutations. 

The nucleic acid molecules are also useful for expressing antigenic portions of the 
proteins. 

The nucleic acid molecules are also useful as probes for determining the 
chromosomal positions of the nucleic acid molecules by means of in situ hybridization 
methods. As indicated by the data presented in Figure 3, the map position was determined 
to be on chromosome 5 by ePCR. 

The nucleic acid molecules are also useful in making vectors containing the gene 
regulatory regions of the nucleic acid molecules of the present invention. 

The nucleic acid molecules are also useful for designing ribozymes corresponding to 
all, or a part, of the mRNA produced from the nucleic acid molecules described herein. 

The nucleic acid molecules are also useful for making vectors that express part, or 
all, of the peptides. 

The nucleic acid molecules are also useful for constructing host cells expressing a 
part, or all, of the nucleic acid molecules and peptides. 

The nucleic acid molecules are also useful for constructing transgenic animals 
expressing all, or a part, of the nucleic acid molecules and peptides. 

The nucleic acid molecules are also useful as hybridization probes for determining 
the presence, level, form and distribution of nucleic acid expression. Experimental data as 
provided in Figure 1 indicates that the transporter proteins of the present invention are 
expressed in humans in the placenta, nervous system and brain detected by a virtual 
northern blot. In addition, PCR-based tissue screening panels indicate expression in 
hippocampus. 

Accordingly, the probes can be used to detect the presence of, or to determine levels 
of, a specific nucleic acid molecule in cells, tissues, and in organisms. The nucleic acid 
whose level is determined can be DNA or RNA. Accordingly, probes corresponding to the 
peptides described herein can be used to assess expression and/or gene copy number in a 
given cell, tissue, or organism. These uses are relevant for diagnosis of disorders involving 
an increase or decrease in transporter protein expression relative to normal results. 
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In vitro techniques for detection of mRNA include Northern hybridizations and in 
situ hybridizations. In vitro techniques for detecting DNA include Southern hybridizations 
and in situ hybridization. 

Probes can be used as a part of a diagnostic test kit for identifying cells or tissues 
that express a transporter protein, such as by measuring a level of a transporter-encoding 
nucleic acid in a sample of cells from a subject e.g., mRNA or genomic DNA, or 
determining if a transporter gene has been mutated. Experimental data as provided in 
Figure 1 indicates that the transporter proteins of the present invention are expressed in 
humans in the placenta, nervous system and brain detected by a virtual northern blot. In 
addition, PCR-based tissue screening panels indicate expression in hippocampus. 

Nucleic acid expression assays are useful for drug screening to identify compounds 
that modulate transporter nucleic acid expression. 

The invention thus provides a method for identifying a compound that can be used 
to treat a disorder associated with nucleic acid expression of the transporter gene, 
particularly biological and pathological processes that are mediated by the transporter in 
cells and tissues that express it. Experimental data as provided in Figure 1 indicates 
expression in humans in the placenta, nervous system, brain and hippocampus. The method 
typically includes assaying the ability of the compound to modulate the expression of the 
transporter nucleic acid and thus identifying a compound that can be used to treat a disorder 
characterized by undesired transporter nucleic acid expression. The assays can be 
performed in cell-based and cell-free systems. Cell-based assays include cells naturally 
expressing the transporter nucleic acid or recombinant cells genetically engineered to 
express specific nucleic acid sequences. 

The assay for transporter nucleic acid expression can involve direct assay of nucleic 
acid levels, such as mRNA levels, or on collateral compounds involved in the signal 
pathway. Further, the expression of genes that are up- or down-regulated in response to the 
transporter protein signal pathway can also be assayed. In this embodiment the regulatory 
regions of these genes can be operably linked to a reporter gene such as luciferase. 

Thus, modulators of transporter gene expression can be identified in a method 
wherein a cell is contacted with a candidate compound and the expression of mRNA 
determined. The level of expression of transporter mRNA in the presence of the candidate 



44 



CL001205 



compound is compared to the level of expression of transporter mRNA in the absence of the 
candidate compound. The candidate compound can then be identified as a modulator of 
nucleic acid expression based on this comparison and be used, for example to treat a 
disorder characterized by aberrant nucleic acid expression. When expression of mRNA is 
statistically significantly greater in the presence of the candidate compound than in its 
absence, the candidate compound is identified as a stimulator of nucleic acid expression. 
When nucleic acid expression is statistically significantly less in the presence of the 
candidate compound than in its absence, the candidate compound is identified as an 
inhibitor of nucleic acid expression. 

The invention further provides methods of treatment, with the nucleic acid as a 
target, using a compound identified through drug screening as a gene modulator to modulate 
transporter nucleic acid expression in cells and tissues that express the transporter. 
Experimental data as provided in Figure 1 indicates that the transporter proteins of the 
present invention are expressed in humans in the placenta, nervous system and brain 
detected by a virtual northern blot. In addition, PCR-based tissue screening panels 
indicate expression in hippocampus. Modulation includes both up-regulation (i.e. 
activation or agonization) or down-regulation (suppression or antagonization) or nucleic 
acid expression. 

Alternatively, a modulator for transporter nucleic acid expression can be a small 
molecule or drug identified using the screening assays described herein as long as the drug 
or small molecule inhibits the transporter nucleic acid expression in the cells and tissues that 
express the protein. Experimental data as provided in Figure 1 indicates expression in 
humans in the placenta, nervous system, brain and hippocampus. 

The nucleic acid molecules are also useful for monitoring the effectiveness of 
modulating compounds on the expression or activity of the transporter gene in clinical trials 
or in a treatment regimen. Thus, the gene expression pattern can serve as a barometer for 
the continuing effectiveness of treatment with the compound, particularly with compounds 
to which a patient can develop resistance. The gene expression pattern can also serve as a 
marker indicative of a physiological response of the affected cells to the compound. 
Accordingly, such monitoring would allow either increased administration of the compound 
or the administration of alternative compounds to which the patient has not become 
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resistant. Similarly, if the level of nucleic acid expression falls below a desirable level, 
administration of the compound could be commensurately decreased. 

The nucleic acid molecules are also useful in diagnostic assays for qualitative 
changes in transporter nucleic acid expression, and particularly in qualitative changes that 
lead to pathology. The nucleic acid molecules can be used to detect mutations in transporter 
genes and gene expression products such as mRNA. The nucleic acid molecules can be 
used as hybridization probes to detect naturally occurring genetic mutations in the 
transporter gene and thereby to determine whether a subject with the mutation is at risk for a 
disorder caused by the mutation. Mutations include deletion, addition, or substitution of one 
or more nucleotides in the gene, chromosomal rearrangement, such as inversion or 
transposition, modification of genomic DNA, such as aberrant methylation patterns or 
changes in gene copy number, such as amplification. Detection of a mutated form of the 
transporter gene associated with a dysfunction provides a diagnostic tool for an active 
disease or susceptibility to disease when the disease results from overexpression, 
underexpression, or altered expression of a transporter protein. 

Individuals carrying mutations in the transporter gene can be detected at the nucleic 
acid level by a variety of techniques. Figure 3 provides information on SNPs that have 
been identified in a gene encoding the transporter protein of the present invention. 33 1 
SNP variants were found, including 27 indels (indicated by a "-") and 1 SNP in exons. 
SNPs, identified at different nucleotide positions in introns and regions 5 5 and 3' of the 
ORF, may affect control/regulatory elements. As indicated by the data presented in 
Figure 3, the map position was determined to be on chromosome 5 by ePCR. Genomic 
DNA can be analyzed directly or can be amplified by using PCR prior to analysis. RNA or 
cDNA can be used in the same way. In some uses, detection of the mutation involves the 
use of a probe/primer in a polymerase chain reaction (PCR) (see, e.g. U.S. Patent Nos. 
4,683,195 and 4,683,202), such as anchor PCR or RACE PCR, or, alternatively, in a ligation 
chain reaction (LCR) (see, e.g., Landegran et aL, Science 247:1077-1080 (1988); and 
Nakazawa et aL, PNAS 97:360-364 (1994)), the latter of which can be particularly useful for 
detecting point mutations in the gene (see Abravaya et al, Nucleic Acids Res. 23:675-682 
(1995)). This method can include the steps of collecting a sample of cells from a patient, 
isolating nucleic acid (e.g., genomic, mRNA or both) from the cells of the sample, 
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contacting the nucleic acid sample with one or more primers which specifically hybridize to 
a gene under conditions such that hybridization and amplification of the gene (if present) 
occurs, and detecting the presence or absence of an amplification product, or detecting the 
size of the amplification product and comparing the length to a control sample. Deletions 
and insertions can be detected by a change in size of the amplified product compared to the 
normal genotype. Point mutations can be identified by hybridizing amplified DNA to 
normal RNA or antisense DNA sequences. 

Alternatively, mutations in a transporter gene can be directly identified, for example, 
by alterations in restriction enzyme digestion patterns determined by gel electrophoresis. 

Further, sequence-specific ribozymes (U.S. Patent No. 5,498,531) can be used to 
score for the presence of specific mutations by development or loss of a ribozyme cleavage 
site. Perfectly matched sequences can be distinguished from mismatched sequences by 
nuclease cleavage digestion assays or by differences in melting temperature. 

Sequence changes at specific locations can also be assessed by nuclease protection 
assays such as RNase and SI protection or the chemical cleavage method. Furthermore, 
sequence differences between a mutant transporter gene and a wild-type gene can be 
determined by direct DNA sequencing. A variety of automated sequencing procedures can 
be utilized when performing the diagnostic assays (Naeve, C.W., (1995) Biotechniques 
72:448), including sequencing by mass spectrometry (see, e.g., PCT International 
Publication No. WO 94/16101; Cohen etal,Adv, Chromatogr. 56:127-162 (1996); and 
Griffin etal,Appl Biochem. Biotechnol 55:147-159(1993)). 

Other methods for detecting mutations in the gene include methods in which 
protection from cleavage agents is used to detect mismatched bases in RNA/RNA or 
RNA/DNA duplexes (Myers et al 9 Science 230:1242 (1985)); Cotton et al, PNAS 55:4397 
(1 988); Saleeba et al , Meth Enzymol 21 7:286-295 (1 992)), electrophoretic mobility of 
mutant and wild type nucleic acid is compared (Orita et al , PNAS 86:2166 (1 989); Cotton et 
al, Mutat. Res. 255:125-144 (1993); and Hayashi et al, Genet Anal Tech Appl 9:13-19 
(1992)), and movement of mutant or wild-type fragments in polyacrylamide gels containing 
a gradient of denaturant is assayed using denaturing gradient gel electrophoresis (Myers et 
al, Nature 575:495 (1985)). Examples of other techniques for detecting point mutations 
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include selective oligonucleotide hybridization, selective amplification, and selective primer 
extension. 

The nucleic acid molecules are also useful for testing an individual for a genotype 
that while not necessarily causing the disease, nevertheless affects the treatment modality. 
Thus, the nucleic acid molecules can be used to study the relationship between an 
individual's genotype and the individual's response to a compound used for treatment 
(pharmacogenomic relationship). Accordingly, the nucleic acid molecules described herein 
can be used to assess the mutation content of the transporter gene in an individual in order to 
select an appropriate compound or dosage regimen for treatment. Figure 3 provides 
information on SNPs that have been identified in a gene encoding the transporter protein 
of the present invention. 331 SNP variants were found, including 27 indels (indicated by 
a "-") and 1 SNP in exons. SNPs, identified at different nucleotide positions in introns 
and regions 5 5 and 3' of the ORF, may affect control/regulatory elements. 

Thus nucleic acid molecules displaying genetic variations that affect treatment 
provide a diagnostic target that can be used to tailor treatment in an individual. 
Accordingly, the production of recombinant cells and animals containing these 
polymorphisms allow effective clinical design of treatment compounds and dosage 
regimens. 

The nucleic acid molecules are thus useful as antisense constructs to control 
transporter gene expression in cells, tissues, and organisms. A DNA antisense nucleic acid 
molecule is designed to be complementary to a region of the gene involved in transcription, 
preventing transcription and hence production of transporter protein. An antisense RNA or 
DNA nucleic acid molecule would hybridize to the mRNA and thus block translation of 
mRNA into transporter protein. 

Alternatively, a class of antisense molecules can be used to inactivate mRNA in 
order to decrease expression of transporter nucleic acid. Accordingly, these molecules can 
treat a disorder characterized by abnormal or undesired transporter nucleic acid expression. 
This technique involves cleavage by means of ribozymes containing nucleotide sequences 
complementary to one or more regions in the mRNA that attenuate the ability of the mRNA 
to be translated. Possible regions include coding regions and particularly coding regions 
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corresponding to the catalytic and other functional activities of the transporter protein, such 
as ligand binding. 

The nucleic acid molecules also provide vectors for gene therapy in patients 
containing cells that are aberrant in transporter gene expression. Thus, recombinant cells, 
which include the patient's cells that have been engineered ex vivo and returned to the 
patient, are introduced into an individual where the cells produce the desired transporter 
protein to treat the individual. 

The invention also encompasses kits for detecting the presence of a transporter 
nucleic acid in a biological sample. Experimental data as provided in Figure 1 indicates 
that the transporter proteins of the present invention are expressed in humans in the 
placenta, nervous system and brain detected by a virtual northern blot. In addition, PCR- 
based tissue screening panels indicate expression in hippocampus. For example, the kit 
can comprise reagents such as a labeled or labelable nucleic acid or agent capable of 
detecting transporter nucleic acid in a biological sample; means for determining the amount 
of transporter nucleic acid in the sample; and means for comparing the amount of 
transporter nucleic acid in the sample with a standard. The compound or agent can be 
packaged in a suitable container. The kit can further comprise instructions for using the kit 
to detect transporter protein mRNA or DNA. 

Nucleic Acid Arrays 

The present invention further provides nucleic acid detection kits, such as arrays 
or microarrays of nucleic acid molecules that are based on the sequence information 
provided in Figures 1 and 3 (SEQ ID NOS:l and 3). 

As used herein "Arrays" or "Microarrays" refers to an array of distinct 
polynucleotides or oligonucleotides synthesized on a substrate, such as paper, nylon or 
other type of membrane, filter, chip, glass slide, or any other suitable solid support. In 
one embodiment, the microarray is prepared and used according to the methods described 
in US Patent 5,837,832, Chee et al, PCT application W095/1 1995 (Chee et al), 
Lockhart, D. J. et al (1996; Nat. Biotech. 14: 1675-1680) and Schena, M. et al (1996; 
Proc. Natl. Acad. Sci. 93: 10614-10619), all of which are incorporated herein in their 
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entirety by reference. In other embodiments, such arrays are produced by the methods 
described by Brown et al. 9 US Patent No. 5,807,522. 

The microarray or detection kit is preferably composed of a large number of 
unique, single-stranded nucleic acid sequences, usually either synthetic antisense 
oligonucleotides or fragments of cDNAs, fixed to a solid support. The oligonucleotides 
are preferably about 6-60 nucleotides in length, more preferably 15-30 nucleotides in 
length, and most preferably about 20-25 nucleotides in length. For a certain type of 
microarray or detection kit, it may be preferable to use oligonucleotides that are only 7- 
20 nucleotides in length. The microarray or detection kit may contain oligonucleotides 
that cover the known 5 r , or 3', sequence, sequential oligonucleotides that cover the full 
length sequence; or unique oligonucleotides selected from particular areas along the 
length of the sequence. Polynucleotides used in the microarray or detection kit may be 
oligonucleotides that are specific to a gene or genes of interest. 

In order to produce oligonucleotides to a known sequence for a microarray or 
detection kit, the gene(s) of interest (or an ORF identified from the contigs of the present 
invention) is typically examined using a computer algorithm which starts at the 5 1 or at 
the 3' end of the nucleotide sequence. Typical algorithms will then identify oligomers of 
defined length that are unique to the gene, have a GC content within a range suitable for 
hybridization, and lack predicted secondary structure that may interfere with 
hybridization. In certain situations it may be appropriate to use pairs of oligonucleotides 
on a microarray or detection kit. The "pairs" will be identical, except for one nucleotide 
that preferably is located in the center of the sequence. The second oligonucleotide in the 
pair (mismatched by one) serves as a control. The number of oligonucleotide pairs may 
range from two to one million. The oligomers are synthesized at designated areas on a 
substrate using a light-directed chemical process. The substrate may be paper, nylon or 
other type of membrane, filter, chip, glass slide or any other suitable solid support. 

In another aspect, an oligonucleotide may be synthesized on the surface of the 
substrate by using a chemical coupling procedure and an ink jet application apparatus, as 
described in PCT application W095/251 1 16 (Baldeschweiler et al) which is incorporated 
herein in its entirety by reference. In another aspect, a "gridded" array analogous to a dot 
(or slot) blot may be used to arrange and link cDNA fragments or oligonucleotides to the 
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surface of a substrate using a vacuum system, thermal, UV, mechanical or chemical 
bonding procedures. An array, such as those described above, may be produced by hand 
or by using available devices (slot blot or dot blot apparatus), materials (any suitable 
solid support), and machines (including robotic instruments), and may contain 8, 24, 96, 
384, 1536, 6144 or more oligonucleotides, or any other number between two and one 
million which lends itself to the efficient use of commercially available instrumentation. 

In order to conduct sample analysis using a microarray or detection kit, the RNA 
or DNA from a biological sample is made into hybridization probes. The mRNA is 
isolated, and cDNA is produced and used as a template to make antisense RNA (aRNA). 
The aRNA is amplified in the presence of fluorescent nucleotides, and labeled probes are 
incubated with the microarray or detection kit so that the probe sequences hybridize to 
complementary oligonucleotides of the microarray or detection kit. Incubation conditions 
are adjusted so that hybridization occurs with precise complementary matches or with 
various degrees of less complementarity. After removal of nonhybridized probes, a 
scanner is used to determine the levels and patterns of fluorescence. The scanned images 
are examined to determine degree of complementarity and the relative abundance of each 
oligonucleotide sequence on the microarray or detection kit. The biological samples may 
be obtained from any bodily fluids (such as blood, urine, saliva, phlegm, gastric juices, 
etc.), cultured cells, biopsies, or other tissue preparations. A detection system may be 
used to measure the absence, presence, and amount of hybridization for all of the distinct 
sequences simultaneously. This data may be used for large-scale correlation studies on 
the sequences, expression patterns, mutations, variants, or polymorphisms among 
samples. 

Using such arrays, the present invention provides methods to identify the 
expression of the transporter proteins/peptides of the present invention. In detail, such 
methods comprise incubating a test sample with one or more nucleic acid molecules and 
assaying for binding of the nucleic acid molecule with components within the test 
sample. Such assays will typically involve arrays comprising many genes, at least one of 
which is a gene of the present invention and or alleles of the transporter gene of the 
present invention. Figure 3 provides information on SNPs that have been identified in a 
gene encoding the transporter protein of the present invention. 331 SNP variants were 
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found, including 27 indels (indicated by a "-") and 1 SNP in exons. SNPs, identified at 
different nucleotide positions in introns and regions 5' and 3 5 of the ORF, may affect 
control/regulatory elements. 

Conditions for incubating a nucleic acid molecule with a test sample vary. 
Incubation conditions depend on the format employed in the assay, the detection methods 
employed, and the type and nature of the nucleic acid molecule used in the assay. One 
skilled in the art will recognize that any one of the commonly available hybridization, 
amplification or array assay formats can readily be adapted to employ the novel 
fragments of the Human genome disclosed herein. Examples of such assays can be found 
in Chard, T, An Introduction to Radioimmunoassay and Related Techniques, Elsevier 
Science Publishers, Amsterdam, The Netherlands (1986); Bullock, G. R. et al y 
Techniques in Immunocytochemistry, Academic Press, Orlando, FL Vol. 1 (1 982), Vol. 2 
(1983), Vol. 3 (1985); Tijssen, P., Practice and Theory of Enzyme Immunoassays: 
Laboratory Techniques in Biochemistry and Molecular Biology, Elsevier Science 
Publishers, Amsterdam, The Netherlands (1985). 

The test samples of the present invention include cells, protein or membrane 
extracts of cells. The test sample used in the above-described method will vary based on 
the assay format, nature of the detection method and the tissues, cells or extracts used as 
the sample to be assayed. Methods for preparing nucleic acid extracts or of cells are well 
known in the art and can be readily be adapted in order to obtain a sample that is 
compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain 
the necessary reagents to carry out the assays of the present invention. 

Specifically, the invention provides a compartmentalized kit to receive, in close 
confinement, one or more containers which comprises: (a) a first container comprising 
one of the nucleic acid molecules that can bind to a fragment of the Human genome 
disclosed herein; and (b) one or more other containers comprising one or more of the 
following: wash reagents, reagents capable of detecting presence of a bound nucleic acid. 

In detail, a compartmentalized kit includes any kit in which reagents are contained 
in separate containers. Such containers include small glass containers, plastic containers, 
strips of plastic, glass or paper, or arraying material such as silica. Such containers 
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allows one to efficiently transfer reagents from one compartment to another compartment 
such that the samples and reagents are not cross-contaminated, and the agents or solutions 
of each container can be added in a quantitative fashion from one compartment to 
another. Such containers will include a container which will accept the test sample, a 
container which contains the nucleic acid probe, containers which contain wash reagents 
(such as phosphate buffered saline, Tris-buffers, etc.), and containers which contain the 
reagents used to detect the bound probe. One skilled in the art will readily recognize that 
the previously unidentified transporter gene of the present invention can be routinely 
identified using the sequence information disclosed herein can be readily incorporated 
into one of the established kit formats which are well known in the art, particularly 
expression arrays. 

Vectors/host cells 

The invention also provides vectors containing the nucleic acid molecules described 
herein. The term "vector" refers to a vehicle, preferably a nucleic acid molecule, which can 
transport the nucleic acid molecules. When the vector is a nucleic acid molecule, the 
nucleic acid molecules are covalently linked to the vector nucleic acid. With this aspect of 
the invention, the vector includes a plasmid, single or double stranded phage, a single or 
double stranded RNA or DNA viral vector, or artificial chromosome, such as a BAC, PAC, 
YAC, OR MAC. 

A vector can be maintained in the host cell as an extrachromosomal element where it 
replicates and produces additional copies of the nucleic acid molecules. Alternatively, the 
vector may integrate into the host cell genome and produce additional copies of the nucleic 
acid molecules when the host cell replicates. 

The invention provides vectors for the maintenance (cloning vectors) or vectors for 
expression (expression vectors) of the nucleic acid molecules. The vectors can function in 
procaryotic or eukaryotic cells or in both (shuttle vectors). 

Expression vectors contain cis-acting regulatory regions that are operably linked in 
the vector to the nucleic acid molecules such that transcription of the nucleic acid molecules 
is allowed in a host cell The nucleic acid molecules can be introduced into the host cell 
with a separate nucleic acid molecule capable of affecting transcription. Thus, the second 
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nucleic acid molecule may provide a trans-acting factor interacting with the cis-regulatory 
control region to allow transcription of the nucleic acid molecules from the vector. 
Alternatively, a trans-acting factor may be supplied by the host cell. Finally, a trans-acting 
factor can be produced from the vector itself. It is understood, however, that in some 
embodiments, transcription and/or translation of the nucleic acid molecules can occur in a 
cell-free system. 

The regulatory sequence to which the nucleic acid molecules described herein can 
be operably linked include promoters for directing mRNA transcription. These include, but 
are not limited to, the left promoter from bacteriophage X, the lac, TRP, and TAC promoters 
from E. coli 9 the early and late promoters from SV40, the CMV immediate early promoter, 
the adenovirus early and late promoters, and retrovirus long-terminal repeats. 

In addition to control regions that promote transcription, expression vectors may also 
include regions that modulate transcription, such as repressor binding sites and enhancers. 
Examples include the SV40 enhancer, the cytomegalovirus immediate early enhancer, 
polyoma enhancer, adenovirus enhancers, and retrovirus LTR enhancers. 

In addition to containing sites for transcription initiation and control, expression 
vectors can also contain sequences necessary for transcription termination and, in the 
transcribed region a ribosome binding site for translation. Other regulatory control elements 
for expression include initiation and termination codons as well as polyadenylation signals. 
The person of ordinary skill in the art would be aware of the numerous regulatory sequences 
that are useful in expression vectors. Such regulatory sequences are described, for example, 
in Sambrook et al 9 Molecular Cloning: A Laboratory Manual 2nd ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY, (1989). 

A variety of expression vectors can be used to express a nucleic acid molecule. 
Such vectors include chromosomal, episomal, and virus-derived vectors, for example 
vectors derived from bacterial plasmids, from bacteriophage, from yeast episomes, from 
yeast chromosomal elements, including yeast artificial chromosomes, from viruses such as 
baculoviruses, papovaviruses such as SV40, Vaccinia viruses, adenoviruses, poxviruses, 
pseudorabies viruses, and retroviruses. Vectors may also be derived from combinations of 
these sources such as those derived from plasmid and bacteriophage genetic elements, e.g. 
cosmids and phagemids. Appropriate cloning and expression vectors for prokaryotic and 
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eukaryotic hosts are described in Sambrook et al, Molecular Cloning: A Laboratory 
Manual. 2nd ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, (1989). 

The regulatory sequence may provide constitutive expression in one or more host 
cells (i.e. tissue specific) or may provide for inducible expression in one or more cell types 
such as by temperature, nutrient additive, or exogenous factor such as a hormone or other 
ligand. A variety of vectors providing for constitutive and inducible expression in 
prokaryotic and eukaryotic hosts are well known to those of ordinary skill in the art. 

The nucleic acid molecules can be inserted into the vector nucleic acid by well- 
known methodology. Generally, the DNA sequence that will ultimately be expressed is 
joined to an expression vector by cleaving the DNA sequence and the expression vector 
with one or more restriction enzymes and then ligating the fragments together. Procedures 
for restriction enzyme digestion and ligation are well known to those of ordinary skill in the 
art. 

The vector containing the appropriate nucleic acid molecule can be introduced into 
an appropriate host cell for propagation or expression using well-known techniques. 
Bacterial cells include, but are not limited to, E. coli, Streptomyces, and Salmonella 
typhimurium. Eukaryotic cells include, but are not limited to, yeast, insect cells such as 
Drosophila, animal cells such as COS and CHO cells, and plant cells. 

As described herein, it may be desirable to express the peptide as a fusion protein. 
Accordingly, the invention provides fusion vectors that allow for the production of the 
peptides. Fusion vectors can increase the expression of a recombinant protein, increase the 
solubility of the recombinant protein, and aid in the purification of the protein by acting for 
example as a ligand for affinity purification. A proteolytic cleavage site may be introduced 
at the junction of the fusion moiety so that the desired peptide can ultimately be separated 
from the fusion moiety. Proteolytic enzymes include, but are not limited to, factor Xa, 
thrombin, and enterotransporter. Typical fusion expression vectors include pGEX (Smith et 
aL, Gene 67:31-40 (1988)), pMAL (New England Biolabs, Beverly, MA) and pRIT5 
(Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), maltose E binding 
protein, or protein A, respectively, to the target recombinant protein. Examples of suitable 
inducible non-fusion E. coli expression vectors include pTrc (Amann et aL, Gene 69:301- 
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315 (1 988)) and pET lid (Studier et al, Gene Expression Technology: Methods in 
Enzymology 755:60-89 (1990)). 

Recombinant protein expression can be maximized in host bacteria by providing a 
genetic background wherein the host cell has an impaired capacity to proteolytically cleave 
the recombinant protein. (Gottesman, S., Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, California (1990) 1 19-128). Alternatively, 
the sequence of the nucleic acid molecule of interest can be altered to provide preferential 
codon usage for a specific host cell, for example E. coll (Wada et al , Nucleic Acids Res. 
20:2111-2118(1992)). 

The nucleic acid molecules can also be expressed by expression vectors that are 
operative in yeast. Examples of vectors for expression in yeast e.g., 5. cerevisiae include 
pYepSecl (Baldari, et al^EMBOl 6:229-234 (1987)), pMFa (Kurjan etaL, Cell 30:933- 
943(1982)), pJRY88 (Schultz etaL, Gene 54:113-123 (1987)), and pYES2 (Invitrogen 
Corporation, San Diego, CA). 

The nucleic acid molecules can also be expressed in insect cells using, for example, 
baculovirus expression vectors. Baculovirus vectors available for expression of proteins in 
cultured insect cells (e.g., Sf 9 cells) include the pAc series (Smith et al 9 MoL Cell Biol 
3:2156-2165 (1983)) and the pVL series (Lucklowe/ a/., Virology 770:31-39(1989)). 

In certain embodiments of the invention, the nucleic acid molecules described herein 
are expressed in mammalian cells using mammalian expression vectors. Examples of 
mammalian expression vectors include pCDM8 (Seed, B. Nature 529:840(1987)) and 
pMT2PC (Kaufman et al, EMBOJ. 6:187-195 (1987)). 

The expression vectors listed herein are provided by way of example only of the 
well-known vectors available to those of ordinary skill in the art that would be useful to 
express the nucleic acid molecules. The person of ordinary skill in the art would be aware 
of other vectors suitable for maintenance propagation or expression of the nucleic acid 
molecules described herein. These are found for example in Sambrook, J., Fritsh, E. F., and 
Maniatis, T. Molecular Cloning: A Laboratory Manual 2nd, ed, Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989. 

The invention also encompasses vectors in which the nucleic acid sequences 
described herein are cloned into the vector in reverse orientation, but operably linked to a 
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regulatory sequence that permits transcription of antisense RNA. Thus, an antisense 
transcript can be produced to all, or to a portion, of the nucleic acid molecule sequences 
described herein, including both coding and non-coding regions. Expression of this 
antisense RNA is subject to each of the parameters described above in relation to expression 
of the sense RNA (regulatory sequences, constitutive or inducible expression, tissue-specific 
expression). 

The invention also relates to recombinant host cells containing the vectors described 
herein. Host cells therefore include prokaryotic cells, lower eukaryotic cells such as yeast, 
other eukaryotic cells such as insect cells, and higher eukaryotic cells such as mammalian 
cells. 

The recombinant host cells are prepared by introducing the vector constructs 
described herein into the cells by techniques readily available to the person of ordinary skill 
in the art. These include, but are not limited to, calcium phosphate transfection, DEAE- 
dextran-mediated transfection, cationic lipid-mediated transfection, electroporation, 
transduction, infection, lipofection, and other techniques such as those found in Sambrook, 
etal {Molecular Cloning: A Laboratory Manual 2nd, ed, Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989). 

Host cells can contain more than one vector. Thus, different nucleotide sequences 
can be introduced on different vectors of the same cell. Similarly, the nucleic acid 
molecules can be introduced either alone or with other nucleic acid molecules that are not 
related to the nucleic acid molecules such as those providing trans-acting factors for 
expression vectors. When more than one vector is introduced into a cell, the vectors can be 
introduced independently, co-introduced or joined to the nucleic acid molecule vector. 

In the case of bacteriophage and viral vectors, these can be introduced into cells as 
packaged or encapsulated virus by standard procedures for infection and transduction. Viral 
vectors can be replication-competent or replication-defective. In the case in which viral 
replication is defective, replication will occur in host cells providing functions that 
complement the defects. 

Vectors generally include selectable markers that enable the selection of the 
subpopulation of cells that contain the recombinant vector constructs. The marker can be 
contained in the same vector that contains the nucleic acid molecules described herein or 
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may be on a separate vector. Markers include tetracycline or ampicillin-resistance genes for 
prokaryotic host cells and dihydrofolate reductase or neomycin resistance for eukaryotic 
host cells. However, any marker that provides selection for a phenotypic trait will be 
effective. 

While the mature proteins can be produced in bacteria, yeast, mammalian cells, and 
other cells under the control of the appropriate regulatory sequences, cell- free transcription 
and translation systems can also be used to produce these proteins using RNA derived from 
the DNA constructs described herein. 

Where secretion of the peptide is desired, which is difficult to achieve with multi- 
transmembrane domain containing proteins such as transporters, appropriate secretion 
signals are incorporated into the vector. The signal sequence can be endogenous to the 
peptides or heterologous to these peptides. 

Where the peptide is not secreted into the medium, which is typically the case with 
transporters, the protein can be isolated from the host cell by standard disruption procedures, 
including freeze thaw, sonication, mechanical disruption, use of lysing agents and the like. 
The peptide can then be recovered and purified by well-known purification methods 
including ammonium sulfate precipitation, acid extraction, anion or cationic exchange 
chromatography, phosphocellulose chromatography, hydrophobic-interaction 
chromatography, affinity chromatography, hydroxylapatite chromatography, lectin 
chromatography, or high performance liquid chromatography. 

It is also understood that depending upon the host cell in recombinant production of 
the peptides described herein, the peptides can have various glycosylation patterns, 
depending upon the cell, or maybe non-glycosylated as when produced in bacteria. In 
addition, the peptides may include an initial modified methionine in some cases as a result 
of a host-mediated process. 

Uses of vectors and host cells 

The recombinant host cells expressing the peptides described herein have a variety 
of uses. First, the cells are useful for producing a transporter protein or peptide that can be 
further purified to produce desired amounts of transporter protein or fragments. Thus, host 
cells containing expression vectors are useful for peptide production. 
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Host cells are also useful for conducting cell-based assays involving the transporter 
protein or transporter protein fragments, such as those described above as well as other 
formats known in the art. Thus, a recombinant host cell expressing a native transporter 
protein is useful for assaying compounds that stimulate or inhibit transporter protein 
function. 

Host cells are also useful for identifying transporter protein mutants in which these 
functions are affected. If the mutants naturally occur and give rise to a pathology, host cells 
containing the mutations are useful to assay compounds that have a desired effect on the 
mutant transporter protein (for example, stimulating or inhibiting function) which may not 
be indicated by their effect on the native transporter protein. 

Genetically engineered host cells can be further used to produce non-human 
transgenic animals. A transgenic animal is preferably a mammal, for example a rodent, such 
as a rat or mouse, in which one or more of the cells of the animal include a transgene. A 
transgene is exogenous DNA that is integrated into the genome of a cell from which a 
transgenic animal develops and which remains in the genome of the mature animal in one or 
more cell types or tissues of the transgenic animal. These animals are useful for studying 
the function of a transporter protein and identifying and evaluating modulators of transporter 
protein activity. Other examples of transgenic animals include non-human primates, sheep, 
dogs, cows, goats, chickens, and amphibians. 

A transgenic animal can be produced by introducing nucleic acid into the male 
pronuclei of a fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing the 
oocyte to develop in a pseudopregnant female foster animal. Any of the transporter protein 
nucleotide sequences can be introduced as a transgene into the genome of a non-human 
animal, such as a mouse. 

Any of the regulatory or other sequences useful in expression vectors can form part 
of the transgenic sequence. This includes intronic sequences and polyadenylation signals, if 
not already included. A tissue-specific regulatory sequence(s) can be operably linked to the 
transgene to direct expression of the transporter protein to particular cells. 

Methods for generating transgenic animals via embryo manipulation and 
microinjection, particularly animals such as mice, have become conventional in the art and 
are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, both by Leder et 
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al, U.S. Patent No. 4,873,191 by Wagner et al and in Hogan, B., Manipulating the Mouse 
Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986). Similar 
methods are used for production of other transgenic animals. A transgenic founder animal 
can be identified based upon the presence of the transgene in its genome and/or expression 
of transgenic mRNA in tissues or cells of the animals. A transgenic founder animal can then 
be used to breed additional animals carrying the transgene. Moreover, transgenic animals 
carrying a transgene can further be bred to other transgenic animals carrying other 
transgenes. A transgenic animal also includes animals in which the entire animal or tissues 
in the animal have been produced using the homologously recombinant host cells described 
herein. 

In another embodiment, transgenic non-human animals can be produced which 
contain selected systems that allow for regulated expression of the transgene. One example 
of such a system is the cre/loxP recombinase system of bacteriophage PI . For a description 
of the cre/loxP recombinase system, see, e.g., Lakso et al PNAS 89:6232-6236 (1992). 
Another example of a recombinase system is the FLP recombinase system of S. cerevisiae 
(O'Gorman et al Science 257:1351-1355 (1991). If a cre/loxP recombinase system is used 
to regulate expression of the transgene, animals containing transgenes encoding both the Cre 
recombinase and a selected protein is required. Such animals can be provided through the 
construction of "double" transgenic animals, e.g., by mating two transgenic animals, one 
containing a transgene encoding a selected protein and the other containing a transgene 
encoding a recombinase. 

Clones of the non-human transgenic animals described herein can also be produced 
according to the methods described in Wilmut, I. et al Nature 355:810-813 (1997) and PCT 
International Publication Nos. WO 97/07668 and WO 97/07669. In brief, a cell, e.g., a 
somatic cell, from the transgenic animal can be isolated and induced to exit the growth cycle 
and enter G 0 phase. The quiescent cell can then be fused, e.g., through the use of electrical 
pulses, to an enucleated oocyte from an animal of the same species from which the 
quiescent cell is isolated. The reconstructed oocyte is then cultured such that it develops to 
morula or blastocyst and then transferred to pseudopregnant female foster animal. The 
offspring born of this female foster animal will be a clone of the animal from which the cell, 
e.g., the somatic cell, is isolated. 
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Transgenic animals containing recombinant cells that express the peptides described 
herein are useful to conduct the assays described herein in an in vivo context. Accordingly, 
the various physiological factors that are present in vivo and that could effect ligand binding, 
transporter protein activation, and signal transduction, may not be evident from in vitro cell- 
free or cell-based assays. Accordingly, it is useful to provide non-human transgenic animals 
to assay in vivo transporter protein function, including ligand interaction, the effect of 
specific mutant transporter proteins on transporter protein function and ligand interaction, 
and the effect of chimeric transporter proteins. It is also possible to assess the effect of null 
mutations, that is mutations that substantially or completely eliminate one or more 
transporter protein functions. 

All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described method 
and system of the invention will be apparent to those skilled in the art without departing 
from the scope and spirit of the invention. Although the invention has been described in 
connection with specific preferred embodiments, it should be understood that the 
invention as claimed should not be unduly limited to such specific embodiments. Indeed, 
various modifications of the above-described modes for carrying out the invention which 
are obvious to those skilled in the field of molecular biology or related fields are intended 
to be within the scope of the following claims. 
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SEQUENCE LISTING 



<110> YAN, Chunhua et al 



<120> ISOLATED HUMAN TRANSPORTER PROTEINS, 

NUCLEIC ACID MOLECULES ENCODING HUMAN TRANSPORTER PROTEINS, 
AND USES THEREOF 



<130> CL001205 



<160> 4 



<170> FastSEQ for Windows Version 4.0 



<210> 1 
<211> 2434 
<212> DNA 
<213> Human 



<400> 1 

tgcagtttaa gtgttcggat tccaagggaa 
agcaagcaag gaaggaactg caggaggaaa 
ggaaaggggg ggaaacacca aatctatgat 
aaaggaatat gcagcacatt tttgccttct 
gtgccaattt ccccaacaat atccagatcg 
aacatgctgc ttttagattt gctttgtcgc 
agattgatat tgtgaacatc agcgacagct 
tctccaaagg agtctatgcc atctttgggt 
cctccttttg tggggccctc cacgtctgct 
ccaatcagtt tgtccttcag ctgcgccctg 
accattacaa gtggcagaaa tttgtctaca 
tgcagaaagt cctggataca gctgctgaga 
tgacaaccac agaggaggga taccggatgc 
ggctggtggt ggtggactgt gaatcagaac 
agctagagaa gaatggcatc ggctaccact 
ttgacttaaa caaattcaag gagagtggcg 
acacagacac tattccggcc aagatcatgc 
acacacgggt ggactggaag agacccaagt 
aggtgatggc tgaggctttc cagagcctgc 
ggaatgctgg ggattgtctg gctaacccag 
agagagctct gcagcaggtg cgatttgaag 
aaggacgccg gaccaactac acgctccacg 
agattggtta ctggaatgaa gatgataagt 
ggggcgataa ttcaagtgtt cagaacagaa 
cttatgtgat gctcaagaag aacgccaatc 
actgtgtaga gctggcggca gagattgcca 
ttgtcagtga tggaaaatac ggagcccgag 
tgggagagct ggtctatgga gtaagttcac 
gagggtttga caggaaatca tttggtggtt 
ccctgtaatt gagtgttgtt gctgctgaac 
gacactcttt gttcaggtta ctggtcccag 
gtcagttgat tgtcgtccaa cacaggtgac 
cagactgctg agttgtctat acaaagtcac 
tgatattaaa tgcatctgga gccaacctaa 
gctctctcat gaaatttcca attcagtcaa 



acagacaaac ctcacgaaag gaaggaagca 60 
agaacaggca gaacagcgag aagaataaag 120 
tggacctggg cttctttttc gccaatgcaa 180 
tctgcaccgg tttcctaggc gcggtagtag 240 
ggggattatt tccaaaccag cagtcacagg 300 
aactcacaga gcccccgaag ctgctccccc 360 
ttgagatgac ctatagattc tgttcccagt 420 
tttatgaacg taggactgtc aacatgctga 480 
tcattacgcc gagctttccc gttgatacat 540 
aactgcagga tgccctcatc agcatcattg 600 
tttatgatgc cgaccggggc ttatccgtcc 660 
agaactggca ggtgacagca gtcaacattt 720 
tctttcagga cctggagaag aaaaaggagc 780 
gcctcaatgc tatcttgggc cagattataa 840 
acattcttgc aaatctgggc ttcatggaca 900 
ccaatgtgac aggtttccag ctggtgaact 960 
agcagtggaa gaatagtgat gctcgagacc 1020 
acacctctgc gctcacctac gatggggtga 1080 
ggaggcagag aattgatata tctcgccggg 1140 
ctgttccctg gggccaaggg atcgacatcc 1200 
gtttaacagg aaacgtgcag tttaatgaga 1260 
tgattgaaat gaaacatgac ggcatccgaa 1320 
ttgtccctgc agccaccgat gcccaagctg 1380 
catacatcgt cacaacaatc ctagaagatc 1440 
agtttgaggg caatgaccgt tacgagggct 1500 
agcacgtggg ctactcctac cgtctggaga 1560 
accctgacac gaaggcctgg aatggcatgg 1620 
tgcagggtgg gaaattagag ggcggaggca 1680 
gggtggccct gcccacagat gtctatgaaa 1740 
agatgagtca tccaaaatcc aatttcttca 1800 
gtccctcaat cccactcaga gtcttgtgac 18 60 
agcatagctc caagatcaat tttcttgagg 1920 
ttgtggctct ctcagtatca gtttcttctc 1980 
ctttctagtt acttgcctct ctagtttcat 2040 
atgcccctta attactctgt tccctagagt 2100 
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gctcccttcc actctccacc cctaagatac tactccttca aaacctatat caaataatac 2160 
ttttttcagg gtgtgtttct ttctttcttc tcataatagg tatgaatgtg ccttttaatt 2220 
gttctcgcct tcccctatag aatttagttg ctggtttttt ttaatggttt accctgcctt 2280 
atataacggt tacctgtgta acaggggtag gactattcta tctttatagt gctcaccaca 2340 
cttgaaataa ctccatgcac aattgctata aaatcttcaa taaattacag cagttttgaa 2400 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa 2434 

<210> 2 
<211> 519 
<212> PRT 
<213> Human 

<400> 2 

Met Gin His He Phe Ala Phe Phe Cys Thr Gly Phe Leu Gly Ala Val 

15 10 15 

Val Gly Ala Asn Phe Pro Asn Asn He Gin He Gly Gly Leu Phe Pro 

20 25 30 

Asn Gin Gin Ser Gin Glu His Ala Ala Phe Arg Phe Ala Leu Ser Gin 

35 40 45 

Leu Thr Glu Pro Pro Lys Leu Leu Pro Gin He Asp He Val Asn He 

50 55 60 

Ser Asp Ser Phe Glu Met Thr Tyr Arg Phe Cys Ser Gin Phe Ser Lys 
65 70 75 80 

Gly Val Tyr Ala He Phe Gly Phe Tyr Glu Arg Arg Thr Val Asn Met 

85 90 95 

Leu Thr Ser Phe Cys Gly Ala Leu His Val Cys Phe He Thr Pro Ser 

100 105 110 

Phe Pro Val Asp Thr Ser Asn Gin Phe Val Leu Gin Leu Arg Pro Glu 

115 120 125 

Leu Gin Asp Ala Leu He Ser He He Asp His Tyr Lys Trp Gin Lys 

130 135 140 

Phe Val Tyr He Tyr Asp Ala Asp Arg Gly Leu Ser Val Leu Gin Lys 
145 150 155 160 

Val Leu Asp Thr Ala Ala Glu Lys Asn Trp Gin Val Thr Ala Val Asn 

165 170 175 

He Leu Thr Thr Thr Glu Glu Gly Tyr Arg Met Leu Phe Gin Asp Leu 

180 185 190 

Glu Lys Lys Lys Glu Arg Leu Val Val Val Asp Cys Glu Ser Glu Arg 

195 200 205 

Leu Asn Ala He Leu Gly Gin He He Lys Leu Glu Lys Asn Gly He 

210 215 220 

Gly Tyr His Tyr He Leu Ala Asn Leu Gly Phe Met Asp He Asp Leu 
225 230 235 240 

Asn Lys Phe Lys Glu Ser Gly Ala Asn Val Thr Gly Phe Gin Leu Val 

245 250 255 

Asn Tyr Thr Asp Thr He Pro Ala Lys He Met Gin Gin Trp Lys Asn 

260 265 270 

Ser Asp Ala Arg Asp His Thr Arg Val Asp Trp Lys Arg Pro Lys Tyr 

275 280 285 

Thr Ser Ala Leu Thr Tyr Asp Gly Val Lys Val Met Ala Glu Ala Phe 

290 295 300 

Gin Ser Leu Arg Arg Gin Arg He Asp He Ser Arg Arg Gly Asn Ala 
305 310 315 320 

Gly Asp Cys Leu Ala Asn Pro Ala Val Pro Trp Gly Gin Gly He Asp 

325 330 335 

He Gin Arg Ala Leu Gin Gin Val Arg Phe Glu Gly Leu Thr Gly Asn 
340 345 350 
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Val 


Gin 


Phe 


Asn 






355 




He 


Glu 


Met 


Lys 




370 






Asp 


Asp 


Lys 


Phe 


385 








Asn 


Ser 


Ser 


Val 


Asp 


Pro 


Tvr 


Val 








420 


Asp 


Arg 


Tyr 


Glu 






435 




His 


Val 


Glv 


Tvr 




450 






Gly 


Ala 


Arg 


Asp 


4 VJ -J 








Leu 


Val 


Tyr 


Gly 


Gly 


Arg 


Gly 


Phe 








500 


Thr 


Asp 


Val 


Tyr 






515 





Glu Lys Gly Arg 
360 

His Asp Gly He 
375 

Val Pro Ala Ala 
390 

Gin Asn Arg Thr 
405 

Met Leu Lys Lys 

Gly Tyr Cys Val 
440 

Ser Tyr Arg Leu 
455 

Pro Asp Thr Lys 
470 

Val Ser Ser Leu 
485 

Asp Arg Lys Ser 
Glu Thr Leu 



Arg Thr Asn Tyr 

Arg Lys He Gly 
380 

Thr Asp Ala Gin 
395 

Tyr He Val Thr 
410 

Asn Ala Asn Gin 
425 

Glu Leu Ala Ala 

Glu He Val Ser 
460 

Ala Trp Asn Gly 
475 

Gin Gly Gly Lys 
4 90 

Phe Gly Gly Trp 
505 



Thr Leu His Val 
365 

Tyr Trp Asn Glu 

Ala Gly Gly Asp 
400 

Thr He Leu Glu 
415 

Phe Glu Gly Asn 
430 

Glu He Ala Lys 
445 

Asp Gly Lys Tyr 

Met Val Gly Glu 
480 

Leu Glu Gly Gly 
495 

Val Ala Leu Pro 
510 



<210> 3 
<211> 213456 
<212> DNA 
<213> Human 

<220> 

<221> misc_f eature 
<222> (1) . . . (213456) 
<223> n = A,T,C or G 

<400> 3 

ttcaaagtag taagatacaa aaactgaaaa tggagaaatt aaaagatgga atcattaaat 60 
cagagactct taatgttggc agagacctta gaagtgatct ttcctgattc tgttctctca 120 
tcttcctatg ctgatgctta ttcttccttt agaaggttcc tacccagtgc ttgctgttat 180 
tagagcctaa gagttaagag ctggggcctg atggtggcat ttactgtgtt taaatgccca 240 
atctctttca ttagcagtga aaacttgggg tgagttttta gacctctgtt tcttcatcta 300 
taaaatgggc atgaggctac ctacacaggt attgtacaaa taaatgatgc aggtgcattt 360 
cttagcatgg tacctggcac attcaaaata gtgaataaac attagcctta aaaattattt 420 
tagctaatta attaatatat gtgaaagtgc ataggaaatc acaaagcatt tgacctccaa 480 
gttgttccta ttttggggct tcattgtaca tggtcagata ttttttacaa atcccagtcc 540 
ttgtagctgt cactcaagtc cacatgatat acagtcctgg tgagggaagt gaagagtctg 600 
tggataaaat ggcctattca tatggcaaag caaaatggag tccaaagtct attatttctt 660 
tcttactaaa gacaaagttg actttccttt tcctggatca tctcctgttg gacaggagtc 720 
ctggccaggg cagtaagtca actggtaacg gccaaagcac tcaagtttgt aggttgtact 780 
gtacatctta gtaggtctgt gtctttatga cttaatccca gaagtcttat ctttttcatc 840 
tgtaaaacag ggctcatggc cactttacag gtttggagtg aagatagatg gataaaaatc 900 
tgtaaagttc cttcaaaaac tgtattctgt ggcggctagt ggctgctgct ttgcatttaa 960 
cactactatc ctgaacacag ctgatctctc tacaggttat ttcagcagaa gccctggtct 1020 
aggagagtct atgaagttcc agcttacacc agacatgacc agcatccaga aagacctgaa 1080 
agaagcttga atcctctcac tagaatccct gcaaaatgac tcatgtaatt gctctgtgta 1140 
agtatcctta gtctttattg tacacccaca cgattctgat gctatagact cctgtggaat 1200 
gcagggaaag agagaagggg gcccatttta aatgcctagg attgaaaaga gaccaccgtt 1260 
tcacttgtaa aggtagacag ggactgtcaa atacctggtc aaaatacctg ccagtcactc 1320 
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cagatcctcc cttgtttgtc tatctgtcat tccttccatt aggagagaga aagctttttt 1380 
ttttttttcc tttaaatttc ctaggaggga tttctagggt ctttccctca ggaattagtt 1440 
gtaggaataa ttgggccagt ggagtgcagg agatatatcc agcgcagccc atgcactcct 1500 
agaaaaagtg acctagatca agcagctggt ggattgagga ctattgtggg gaccccctgc 1560 
cacctactga cttacagctg aacccacatt cccagcagct tcagcctggg ggctggggga 1620 
gcgggcagac cgagctcaga aaggcagggg agggtaaaga ggactgtggg gttgcccctt 1680 
tcaggaccaa gtgccacgtg tcacacaccc ccacctccac ctttctgcac acacagaaag 1740 
gaggataagg tgaggatggg aggaaggggg aacaggtagg gaggtcggct gtggaactcc 1800 
aagctagctc ggtgggtatt agcatagagc ttgctgcctg tgtgagtgtg agggggagag 1860 
cgagagagag caagggaggg agagagaggc aggctgcgag gggagaggag agggagtggg 1920 
ggagccagcg ctccagctag catgaggacg ggcttctttt cccgtgctca gttaatctgg 1980 
ctgtcagttg gtgttaacgc tgcagtttaa gtgttcggat tccaagggaa acagacaaac 2040 
ctcacgaaag gaaggaagca agcaagcaag gaaggaactg caggaggaaa agaacaggca 2100 
gaacagcgag aagaataaag ggaaaggggg ggaaacacca aatctatgat tggacctggg 2160 
cttctttttc gccaatgcaa aaaggaatat gcagcacatt tttgccttct tctgcaccgg 2220 
tttcctaggc gcggtagtag gtgccaattt ccccaacaat atccagatcg gtgagtgagg 2280 
gggcagcctg gggagggact ttctgggtct ggccagggat ttttttgggg ataggggttg 2340 
tgtaccccct ccccggtact gactgttttg cttggctccc taaagctgtg ctgcggtaac 2400 
agaagggaga cttgggctta cagccagagg agggggcttc tcctgatcgg atgaggggca 24 60 
gaggggaagt gttcacacac gcacacatac cctactcgca ctccaggcaa gagcatgtga 2520 
aatggaggaa ccatcgcttt ggaggaaaaa aaaaatcagg ctggaaaggg tggtgggtgt 2580 
ttaaggagtt aactctattg cttggtagat ggtgcttgat tccattttta atgtaagtat 2640 
gtatggtgtg tgtgttttca cgtgtgtgat tatatattac atatgcacat atatatgtaa 2700 
ttgaaggagg cagtgctttc tctgctgggg gacagaaaag agaccctcga gaagaaggag 2760 
tgaggggtgc tgggtatatt gcagccactg aaataatgcc agaaggcccc cactccaagg 2820 
cgggtaggct ccctctcctg ttctggactc ctccagctgc cttctctttg ctgtctgcca 2880 
tgctgcgctg gtggtctcca ctcccccgat cctggaactt cctcgcctgc cttttctccg 2940 
ttccttcctt gccccctcct cttagatttc ctatccacag aggtctacct tttacacaca 3000 
caaacacaca cacgtacaca tgctcctttc tcctcctgtt ggctctccat tatccttgtt 3060 
actgggctcc atcctctcaa cttggaggca ggtttcaaca tgctgcatgc tttttttgtt 3120 
ccccatttcc ccttcttagt tgttacactt ctcaaaggcc ccgccaccct ccctgtattt 3180 
ctgaggggac tgacagatat tgctactcct gataatcatg agggaaagca aaacaaacag 3240 
aagcaataac accaatcaca aagcatgtcc acaagggctt ggggcttccg tgtgaccagc 3300 
atgtcagtgt cgtttgtgtc ccggaatgaa gcaagctgct gtgtttggaa gcatcttcgt 3360 
tggtttggtt tctagtctct ctcccctggt agggagatag ctccactagg gaaagttcgc 3420 
attgctggga gtttgtgctc ttttgtgagt gtgtgtttga ggggggatgt ggtgcaattg 3480 
atattttgtc gggcatacat gtgctgcagt acccatctct ttcagcctct ccagctagct 3540 
ggcggcttgg agtggccatg gagtaacttg ctttgtttcc tgacacctgt taagctacat 3600 
cctgaagtgt gtacgtatct gtgtgttagt gcctaacacg caaaacttcc tgccttcagg 3660 
ggagacactt cccttgtagc ccagcccgta gccttctact cctcaacctc ccatcctttc 3720 
tctttcattc attgcagagg aggagaaagg ggacagggaa gtgtttgggg tgtggtcatc 3780 
tggggggaag ggggagcatc ataatcaaga attttttttg ttttcattct ttaaaatggg 3840 
agcacttgta catgtggaaa tcactggctc ttacatgtgg cattgtttac atttgtgctt 3900 
atatactacc tatcctttgt ggcttggaag tgtttgtcta gtatatttgt gtatatagta 3960 
aatggtataa atgcatgagg atgtttcatg tatgtgctat gttttctcat gtatctctgc 4020 
atatttttaa aattatattt agccctaact tggttgttca actgataaag ccattgctac 4080 
tgttttcaaa aaaaaaagag attgaagcat taaattgtag aacaaaatgc tgaaaatatc 4140 
actgccatca ggtagtactt gattctattt gtgtgatctg taacaaaagg cttgcaatcc 4200 
atttattggt tcttaaaaac agacacccac tctacaagaa gttagaagtg cattatgttt 4260 
ggactgtgat tactgtcaga tttgagataa acttctatgt aaacatcatt atagttcagt 4320 
ctctagaaat ggctgatttt aattcacaga aatgtaattg atttacatgt tgacagtaaa 4380 
tacaaaggta agccatggac tcttgtttct cggaacaggc atcccatcag gcacctttac 4440 
tatacctttt actagaatag aatgtttgta tcctcttgag atgcctaaga tcccacatct 4500 
ttgaagcaca aggttccacc atgcaaacac acaatggcaa tcccttatgt aacaagtcat 4560 
tcctaggtaa ctgggatcat gtgtggtgaa agagtaagca aaacattgtt tgcagaaaag 4 620 
caagagtttt tagagaaaat gaataataaa ccttaggggc aatagaatag ttaaattgca 4 680 
tgcaggtctt gctaggccaa agactaaaaa ctgtccatgt aaacagttat agtagtggag 4740 
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aagcccacag ggtcccagag ctagatttca 
tcccgtaagg ctctctggtg agagatgagg 
gggtgagggg gcttatgtgt atgtttatat 
caagggagga agaagagaaa tggcacaagt 
aggggtacac aatcaaatgt ttgagcaaag 
aagctcatct gtaggagttt aagtttcagt 
ctaccactca aggctacagc tagtgagttt 
agtgagtcgt gaggaactaa aacctgtctc 
ctcatagggg gattatttcc aaaccagcag 
ttgtcgcaac tcacagagcc cccgaagctg 
gacagctttg agatgaccta tagatgtaag 
gcgctagacc aatgaaagga gggcctgtgg 
gttgccattc gtctttgtaa gagaaagccc 
gattccagca gttggcattt tagcccctgc 
ctgtcagcta actttaatgc cagcacgatg 
ttgcagaggt tttctggcac agctgagaga 
tagaattaag ggtcctgagt ttgtgaaaat 
ctacatgtta aatggacaga atgtagcttg 
atctggttct tcattttcgg aagtgaccaa 
aaaatagctt gaggaaattg gttgtttaag 
aaacagaaca aaggtggcct caagaagaag 
atgcattgta aaatgtgggc aacaaacacc 
aaaggaagcc tgtgctcaca tgattggctt 
caggaaacat attttccctt cagggattaa 
tgagctgaat tcacaagaac aatttgagac 
ttggtgtgtc actattgact tggtgacttc 
ccatatttcc tttgtgtctt tctcatagtc 
gctatgcttt gaagtctcaa aatgagaagt 
taaaataatg ttgttacaaa ttccaagaac 
acttactgtg catcagaact atgataaata 
ccatgtatta agcactgact acagcccatg 
tcatttaatc ctcacaggag ctttgcagag 
gaaaactgag gctacaacag attatataac 
ggttttgaga cctgaaggtc catgttcctg 
gtaccctaat caactagtgg gattacccaa 
aggcagaggt ctaataaagt acctaaggag 
tgatttgcac aggactggtc cattgaataa 
ctgaggttga aaatcttatc tactgtgtta 
taattgctga attttagttc tatgaaacta 
agctattttg ctattagata acctttcttt 
tggaaggggc atttggttgg gggcatcagt 
gtgctaattt atttcctgga gcagggctcc 
attacttgca tagagaatca agaataaata 
gaaatatcta cagcaggttt tggttgaaag 
gatgttgact ggcacagcaa gaaaataaaa 
aaaggtggtg cttctgtctt gttatctcag 
atgactgttt attagggaga cagaaatatg 
aggcctggag tggatgaatc tgagttcttg 
acaggtttcc atgaattcct gtcttgttcg 
gctataaact gaaatagata ttggaaatac 
agcttacagt ctagtgggaa acattgttat 
ttgtgcttag actttgtgtt tcttttcttt 
aagttaggtg gaaatggtgg cctggctgga 
ggaatacttt agggctgtgt tcactgtggt 
ccaattttaa ttgagtaaaa aatgtcagtg 
cattactgat acatggaaaa gcaagaaaag 
gttttttgaa atctgaatta gtagttgcct 



atcaccctgc agcactgatt agtacctact 4800 
ctgatgtaag aaaaattaga catgactaga 4860 
aagagaatag cagctcctag gtggtttgca 4 920 
ctcctgtttt ctctgacttt agctgagttg 4 980 
taatagatat tagaggccca tgtcacaaaa 5040 
gtccagttaa gcctatttga actctcatct 5100 
ggcatgggga gaagaaaagg actcatctgg 5160 
tagcccatat accatgttgc attttctttt 5220 
tcacaggaac atgctgcttt tagatttgct 5280 
ctcccccaga ttgatattgt gaacatcagc 5340 
taattgcttc tatttctgag atgtctttct 5400 
gtaggtggtg gtgttgcaaa aatgacttca 54 60 
tccaagaaaa atttctagag gcttctgagt 5520 
aatgcttcat gtaatggatt tgctttttca 5580 
ggtgcttggg ttgactgcat cttccctacc 5640 
gttttacaca tacgaatctt ctgagaggca 5700 
gtggtttaaa gtgcttaagg catttcctat 5760 
ataagaaata atagctgaga agttgtacac 5820 
taattaaaac ttggtgcctt ggtcatttta 5880 
gcacagtttg taggatggtc attgagcatc 5940 
ggatagataa atctctgaca tctgaattga 6000 
attctgagtt ttcctggcaa ctaggaatga 6060 
ttcatgacag atgaaagcat aaaaattaat 6120 
gctcaagtaa aaatttatgg cagggaaatg 6180 
cccaaactgg attgaattgg cacgatagtc 6240 
tgctgagctg tctctggtgg aagtcatgga 6300 
ccactggtga gttctcaatg atttgaatag 6360 
cagaattctt tctatcccaa agaccactcc 6420 
tataattata gcagatttca tttattgaga 6480 
ctttatagtc atttatagtc acattcatta 6540 
taatgcacta attgcttttt atgtgttagc 6600 
tactagacat tatcttcctt ttagagatga 6660 
ttgcccaagg ttactcagga agcaaatggt 6720 
gatactgtga cagatacact gattcttatg 6780 
gaaatgaaag ggaattttca gggtcttgta 6840 
aaaaacttac agtgtttcaa actgtagcta 6900 
ataaaatagc aattcttcat tttggtaatg 6960 
ctcactaccc ttaaaaaaat agctgaagtt 7020 
tctttctggg aatctagaat aatactgagt 7080 
actttttaat gtggctttga ttagggtctt 7140 
gtacctataa atgtttaaac gcatatacat 7200 
atagcttttg tcatacttta tagcttttgt 7260 
gctggtgtgt gtattcactc atgctgctga 7320 
agatgtgaac tgctctccta gtagtgtcaa 7380 
gacaaaaata ggagaaaagg ttctaaggca 7440 
caacctaaga cttgatctct aatagggtta 7500 
atacagacaa tgttgttcat tttggtcaac 7560 
aacttgatgc tgccatccat gaactagtgt 7 620 
ttcagttaat tctttaagtg tttcttcaaa 7680 
agagatgata aaatgccctt gctctcaggt 7740 
ggggattatc tgagaagcaa taatcttatt 7800 
agctcccaga tcccaaaggg ggtactgggc 7860 
ggagcctctc ttaaatttca tgtaaggcta 7920 
ccattggttt gggaaaactt tacatctgtc 7980 
ggagcaaagc agttttagga agaatgtact 8040 
catcttattc agactaattt tctgtgctgt 8100 
taaattaact gcccaataag tcttagatct 8160 
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agaaaatgac caaattttat ttgttttttt 
ccctttgcat tttgctattc attgtccctc 
aagtgtaatg ggacctgttc atgctagaca 
aggtggttat atggtaggga aggtggggaa 
gccaaggtga gagtttggtg ccttcttcgt 
attgttaaag agtaaagcca ttgaatcagc 
cttttccaag cttacagggg tagagtttgg 
atcagctacc cagcttattt tggtgccttt 
atgctgattg agattattac tattagaggt 
ccttttaatt tattcaccta tttattcatt 
tacttcatgc caaacactgg gccagacgct 
gcctctttcc ttgtggagct aacagtttag 
aaattcactc agggagcttg tttaaagcaa 
caagatattc aagagtaggc cctggaggaa 
ttaagaatct ctttaatcta aaaagtcttc 
aaccaaacct aggcatgaca taaagtgcta 
caccctggag tactttatcc atagtaagac 
tgtaatgttg taagtgctat agcagcaatt 
tggagttttt ctctaggagg tggtgattga 
atctgcagaa ggggtcacca aatatagcca 
gttctgggcc catacactgt gtaagaacca 
gtcttgacta atatttttaa atattaaaat 
agcctcaatt tgctgagatg ggacagagta 
agcccattct tcccattttc tttacctgct 
tgattgatag cactgcaatg ttgtctgtgt 
aatataaact gctgtcagaa agaccatgta 
aaatgcctct aatacaaatg aataaatgtc 
aatgttcaga gcctcaatga caataggtga 
cctctgtcat ggacctactg gggaagccca 
caacagagag atggccttag ctcttgcctg 
acagatttgt ttttcttgac ctacacaatg 
ttctggcatc tttaaaaaaa aaacttaggg 
ttaacttcta ttttctagag ctgagaagtg 
ttaaccaaag tttccattac ttcctgtttg 
ggcaggcctg taagtaggcc aatctactca 
tttattattt tgttctaata atatctatta 
aaagatgtat aactcctact ctcaaaaaaa 
tggctgatgg tgggataaac aagaattatt 
caggtgttgt cttaagtttt ataactgtct 
atggtactga gactgagaga gattgtgtct 
agctagtatt taaagccagt ttggggggct 
gatctatgaa tatggccagg ggcagtggct 
cgaggtgggc agatcatgag gtcaggagat 
ccatctctac taaaaataca aaaacttagc 
gctactcagg aggctgaggc aggagaattg 
gccaagatca cgccactgca ctccagcctg 
aaaaaaaaaa aaaaaaaaaa cctatgaata 
aatagcccaa cggtagaagt gtcacagtgg 
ttccaaggca gaagctatga ctgtggatga 
gcaggcatta aagggtggag cagattttgg 
ggtaaagtca caatattcct cctcgcctcc 
atgtggtgga gatttgggag tttgcctcct 
gatgtctaca tttctctggt taatgttaca 
tgcccttaca ttaaggcaca caagtgttga 
ttataatatt gataatcatc agcttttatg 
cctgactagt ttgaagatcc tgctgggatg 
caggataaag tgatcatata gcacatttgg 



ttcataattt ttgctgtcat ttgatttttt 8220 
caccagaggg tggagaaggg aagggagagg 8280 
tgttagcagc aggggactaa aaatgaggag 8340 
gatagaagag gactatggct actactactg 8400 
gttcaatctt tcttgcttcc ctggtctgcc 8460 
aacccaatgc cttgggatct cctttaagca 8520 
ggagattgtg catagagaag atatttactg 8580 
gtggtaagtg taaataaagg aataaatgaa 8640 
ctgagtagca tagttgcagc ttaattctta 8700 
cattcattca gcaagtatat actgaatgcc 87 60 
aggaatatag gggccaggaa agaaggcaca 8820 
cccagaggtt cacaaacttc agtgcactta 8880 
gactcatgat gaacttttta cttcttgaat 8940 
ttctatagca attattatgg ggattacact 9000 
catcccttta gtaagcattt ttgagccctc 9060 
caaatgcaga caacagtaag ttgcattccc 9120 
attttctatc ctgtaaagag aaaattaaaa 9180 
tgtatggagt attgaagatt ccttgaagag 9240 
attgagcctc aacgataaat agggttatct 9300 
tccacaagcc agttccagca cagagactga 9360 
tggaatttca cattagaact tggacatctt 9420 
atctggcaac actagggccc aggtctatgc 9480 
aaccccatct acatggcctc tttcccaatc 9540 
ggctttggca agcattgagt ttgcaagccc 9600 
tgtgtcttta taggttagct tcaccaccag 9660 
tatctatagt ttagtgttga gaacacttgg 9720 
tgtaggagtc agggaattac acagcataaa 9780 
aaggtagcaa ccctaactac agtgggatgc 9840 
ggggcttcct aacaagacat taggcactgg 9900 
aacatggctc atcaacttca gagttgcaac 9960 
ctttttcata tttggcataa aaatccagat 10020 
cattaagcaa ccctaggccc acattcctgc 10080 
ctgccctttt taaatacata aactcccaag 10140 
attgctagag taatcaaatt atgttgtttg 10200 
tacctgattt atatcgtcag cattcatcat 10260 
cttaccctgt caaaaataat aaaagacaca 10320 
gtagtgtgta tacagtaatg aagagaatgg 10380 
ttgaagacct actatgtgcc aaatactttg 10440 
tgaaaggtga ttatccccat ttcattgatg 10500 
tctttatatt atgcaacaaa taaaaaccta 10560 
ccaaaactca atgagcattc tccttttaaa 10620 
cacacttgta atcccagcac tttggaaggc 10680 
tgacaccatc ctggctaata cagtgaaacc 1074 0 
tgggcgtggt ggcacgcacc tgtagtccca 10800 
cttgaactca ggaggcagag gttgcattga 10860 
gcaacagagc gagaatttgt ctcaaaaaaa 10920 
tatggacaga gaaaagtacc atagcagtga 10980 
aatttatgat tcaagcttaa gagctaagtc 11040 
tgaaaacttc ttggggaagg tggaacttta 11100 
ttgtggagtg ggaggagggt attttgagtg 11160 
cctcaaccac tgctgcatat gcttgttacc 11220 
gcaccacact atgttctcag ctccatggca 11280 
gtctgaagtt ggctcctcaa tcttcacccg 11340 
catttgtctg tggttgactc ttggctctta 11400 
gatagaaaaa tccttgtaaa catttctttc 114 60 
agtgagctta ctgctctcct ctcagccact 11520 
aaaactggta agcttcatgt atttgttcat 11580 
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tcattcattc atccgaaacc cttttattat caccttctat ggggatatgt actgtcactg 11640 
ccccactcca tacctttgta gacagcatta gttgattaca gaattttttc ccactgaatc 11700 
tacacgtggc ctcaaattgg tgacagctga tgcaagagat gaaatccatt tttcattaca 11760 
aggcattgca ctagaaattg agaaaaataa aatgcattag gcattgattt tcctcttaag 11820 
gggctcaggt tcttataaag gaaataggca taaaccaaat ctgcctctct ttctctctct 11880 
ctctctctct ctctctcccc ctcttacata cacacacaca cacacacaca cacacacaca 11940 
cacacacaca cattcagagg aaggaaggaa aacctttagt tggagggatt taggtttcat 12000 
aaaacaagga catttgagtt gaatttggaa agataggtag gatgtagatg ggaaaatata 12060 
tatgtatata tttgtataca cacacacaca ccccacacac cttcctgtaa tggcaggttt 12120 
gatgcagtaa atagcatact gagttgagat ttgaagacat ggattcaaat gctggactac 12180 
cttcttgaat cgtttaggta taaccttgga gaaacatccc ctcccccaag cttatttata 12240 
aaatgaagct catggtctct gagagtcctt ctggggctgt atttctatgc tggcagcata 12300 
tggaaaacac cagagatata tgattaatag aagttctcta cctttggcta gcaagaggat 12360 
ccctaaacaa gctaagtgtc taaggaaagc tctagacttc ccttccacca acttgcacat 12420 
tcagggcgca ttctgtatgc aaatgctgaa gcattatgcc tactttgggg cctcagatga 12480 
attcattcaa tatttaacaa atatttatgg agtgcttact attgtgaaca aaactagaaa 12540 
gcagccctgt gctcatggag ctcaccagct aataaaggag gctgccatta gatgatcaca 12600 
tgaatgagtg tataggttgt aataaacaat ctgaaagaaa gggccatgat tctatgaaag 12660 
catatacaaa ggaaccagct cgtgcaggag gatgaatagg tgcttgagct gaggactgaa 12720 
ggctttggtg atgcagtgat gggaacagga ggagtgtttg agaagcttcc tgggcagagg 12780 
cctggccagg ccttgtaggc tctgttatgg attttgatct ttagctcaaa cactaaagaa 12840 
cccatggaag ggttttaagc agagaagtgc cgtgtttaga tttctgtttt ctcatcagaa 12900 
atcagatgct tgatagtact ttgccaaatt cacatttgct gaagacgtct ggcttgtgtt 12960 
tcaatggcct tgtttcttta cttccatctc cagtttggga tggagtctgg cagatgggca 13020 
ggaggatggg gacagaggcc aggggaaagg gaagctgatg acttggttta ggggtgcgtt 13080 
gtcatggtat gtggctctct gtgtgcccat gggcattcct ggagattgga ggctgatcca 13140 
gtgcagtgcc tggagaactg gtgaggctgt gtggaggcag ggattcattc agagtcctca 13200 
tttattttaa caaggaagac atacatagca ctgaagctca gcctctgttc caatcagaca 13260 
actcagtggt cttcaaattc acactgccat ttccttcact tacacttaca aactgctcct 13320 
tccatactgc tctgctgaaa ttctgtcctt ctggtcagcc ctagctctta ttttaccttc 13380 
ttcactgagc cttttttctc tttcatattc acatgctctt ctgcccttca gctgttttta 13440 
ccatctccct agcaatggtc acgcttatcc ccaggtttta ctcaccttgt cacttgaaca 13500 
cgtgtcttac ctgcccaact ggactaagca atgtgaggtc aatgaccttg tttactttag 13560 
cttcccatca cagtctccac cacagtatag tgttcagtgg tgctagatga atgaaagaat 13620 
agatggcagt gacaagaata agcctgaatc attgctgctt agggtaatgc ttttgatatt 13680 
atgagttttg agagttccaa caggctgtgc tgtaaatacc tatggatggt tattactgtg 13740 
tagctccaaa gttatggggg tgaaaatcag gtgtaagtcc aattgaaagc tcattttagc 13800 
tcccgctgga ctattgtcat gaggggtcct aaataaggta taaggttcca tcttctaaat 13860 
aaagaatttc tgattcaagc cctgggtttg ccattgatgt atgagccttg tatgcatctt 13920 
acgactttgg tgtgtcccag cctgctgtcc agaaagcaca ggggagccca gggccatggg 13980 
aggtccaggt tgaggaggca ggcaaaactg gggcaaatga ggcagttgga ggctaatgaa 14040 
attgataccc tatggttagg cgtgtttcca tgtgttcatt gtttggaacc tctaggatga 14100 
aaataaagtt tgagaatagc ctttagtttg tctacaaaag taatatatac attacagaaa 14160 
gctataggaa cagagacagc aaacacaata aaacttgtat aattatagta ctaaggatac 14220 
attttccatc cttttttgaa cctaacaatt taaacattaa tagatttatc ttactgtctt 14280 
ataacatgat tttttgcata ttacaacctt taggctactt ttattattgc acattcattt 14340 
ttaatggttt cataaaagcc cactgtacat atgtacccta atttatttaa gctgtctcct 14400 
tttatgggat atctagttta tttccctttt tttcttaaaa acagtgctgt gatatttctc 14460 
tcaataaaat tttaatcatt tgtacatatt ttgttgtgat tttaagggaa ctataaaccc 14520 
agtagaacaa tttccaatga aattgaataa aatttctagt atgtgcagaa cttgagataa 14580 
cttgtttaat gtgagttctg acattgactt gctgtgtaac cttgaaaagc ttttatatcc 14640 
agtttagttt ccagatgtat aatatggggt gctaatacct gattctcaaa cttattagaa 14700 
caggcaagtg ccattatttg tatgaaagag ctttgaaaaa tcaaacatgc taggcaaacg 14760 
cagggtaata taattaacca ggcttcttat ctattagcat aaatgcacat aagtaaaatg 14820 
taactaatta atttttagtg ttttcctagg aaggtgttca ttacccataa atgaatctat 14880 
tttatggata gggaaaattg aaatcttaag gctaagttta tccaagatct aaatatatag 14 940 
ctatgctaag tctgaaagca tttcttttca tttatgattc aatccatgaa aatggacgaa 15000 
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atatgtggat cataagagtt tctttggctt aagaaattat tgtttataga aataaggggg 15060 
aaagcccatt tgagcagtaa gcagtcagga ctgtttgtgg caagtgacag aaacccaact 15120 
aactggatag ttttaacttc aggtatagcc aaattcaagg gcttgattga tgtcatcagg 15180 
actgagacat gccctctaga tttttatgct tagcttttct ctaaactggc tttactctca 15240 
ggcatagata tgtcatatat cagatcccag cactggcagc cttcacctgc ccagctacaa 15300 
gtccagtgga aaggaaactt ctcttttctc atcagttcca caaaagtcct gggcctgatg 15360 
tcattggata aaacttagga cacatgctca tttctgaata tgacacagtg gccaggatat 15420 
tgtagtcttt tgcttgacat gggcctagtg aaatcatcat ttttagagct tggggtagtg 15480 
agaagaacca aaagattgtg gatttaagcc agcccaccaa aaccatacgg acatggggga 15540 
ggaataattc cctcaaaggg aatcaggatg ctattaccaa aagaatggag attgaatgct 15600 
tgagaggcaa aatagatata tatatatatt tttgtctatt aagcatatat atatatacac 15660 
tatctatata tagatgtaaa tgtatttatt gtctatttta tgtatttatt gtaaataaat 15720 
acatttactc tctctatata tatattctat ctatatatag atgtccccca ttgcacatct 15780 
tttcagaaaa gaaaattcag acaggctgga gctgaaccaa tgcagacaaa aagtcaacat 15840 
gctaaagtca gaagagccac caatgagaca agcacataca agggattctg gtgggtaagt 15900 
caagatcaga agtcaggaca ggcaaaaggt ctgaaattca ggcttctgag ttaatcaaag 15960 
ctgtaggggt ctttggtaat ccttgtccag ccacgagcaa acctgtgttc acagaacagt 16020 
tagaaagtgc tagccattca cccacataga agcgactggg aggtggtctc tccaggccac 16080 
tctgataata agttgaggtt gtcggctgta aaatgtttgg tgatcctgag gcaccaatac 16140 
tctgtcccag tgaccaggac atttcctgct ctactcaaac actcaagcta ttgtgcacaa 16200 
actctttaac agtcccagtt ggaatcttgg ctgcaaatct gatgagccca caccaccaga 162 60 
tggtaggttc tcctaattgg ttccatcccc tctagttttt ggcaggaacg aaaccttccc 16320 
tatacctgaa gaggggacag ggagagccag gatcaaatga aggaggcttt aacctctgtt 16380 
aaacaagcag agatctcaag ctggctcctg tccaaggcaa acagggcagt atggttcttt 16440 
tacataggtc atggatttac aggggactga tatcctaaga ctaacactga tgaggcacaa 16500 
gactgtaatg attgaaaagt ggtttagcta caaaggtcaa ctgatttggg ggaaggtgca 16560 
cttgtttatt caataattga tatttgtgga gcatcgtggt agatgagatt attatttggc 16620 
aattatttct ctctccctca aactccttgg gaaaaaaata taacttcctg cttcattaat 16680 
gtcacatttg gtcatttgac atgctctggc aacttagata ttagtagata tgtttgaagc 16740 
agagattatg agaagaacat gccccagaga gcctgctcgt ctcataggaa aggtaaatac 16800 
agtaagtaga agtgtcccag tcaacccaca gcccacagac taaagtttga agcagagcta 16860 
tcccagctga cctgaaaatc tgtgagcatt gtaggaaatg ctcaatgtta tataccactg 16920 
aaattggaaa tggtttgttt tgcagcaata gctggccaat agaagcaacc agcgtgtgtt 16980 
ccacactttg ctaggcactg agggaacatt ggaaatgaca gacatagtcc tggtctccat 17040 
ggagcactat tgtagctagg aagatagaca atgaatacat ttacaaataa aatatatata 17100 
gagagatcat tactagggtc ataaagaaaa taggatgtag tgacatcacc actatggtag 17160 
agaagaacac ctttaaagta agtggtcaaa gaaggcattt caaaggaggt gacttctgag 17220 
ctcagatata aaagttggaa gagagccaaa tgagagtaaa ttcccctgtg gaatgtcgaa 17280 
atccagcccc atgggcaact tggattgaga atcttggtct aggcagagga gaatgaggtg 17340 
ccctggaaaa atctcctgca aactaggtca ctcacttgtc ttagggcaga tactagcttc 17400 
tctctaccac atcacatgtg caagtcaagg gctggcagat cacttgctgc caagactaga 174 60 
tggtctcagg cattggaagc cactgagcat agcaaggcag agagggggag gtatgctagg 17520 
gcattcctaa ccatgaggtg atggaattgg ctatacttct tggcttggct ccagaatact 17580 
caagagttgt tgcttattag gatggcaaga aggggtgtgt gtgtgtgtgt gtgtgtgtgt 17 640 
gtgtgtgtgt gtgtgttcat acatatattc attctttcac agaggaggaa aatgaactag 17700 
atggcttcta gatccctttc caagtgtgtg attataaaaa ggatgctgag atttaaatta 17760 
tgtgtgaagg aaatgttaaa ttgaagaact tcatcttagc tctgtgccaa gctccatttg 17820 
ttaattgagg tggttaatta gaaaggagtc tcccttgagc aagcaggata agctccagca 17880 
tccctaatct atctggtaaa ggctatagag gtgtcacacc tacaagggag aacaaagaac 17940 
atctaatgca agccctctgt ttaatcttca ctccagacac tgaaaagcaa gaatgttggg 18000 
ggacaggaag ctcacagcgc ccgcatggaa tttagactcc aagtagttta ttagctttgg 18060 
gaaaattatt taacttacac tgcacttact agctcttctt ccttactcca agatcactct 18120 
gaatacttta aaatatggtt ggacatggtg gctcacacct gtaatcccag cactttggga 18180 
ggccaaggcg ggcagatcac ttgaggccag gagttcgaga tcagtcgggc caacatggca 18240 
aaaccctgcc tctactaaaa aatacaaaaa atagctggca tggtggcaca cacctggaat 18300 
ctcagctact tgggaggctg aggcacgaga attacttgaa tatgggaggt ggaggttgca 18360 
gtgagccaag atcaagccac tgcactccag cctgggagaa agagtgagac tccttctcaa 18420 
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aaaataataa taataaataa ataaaagatg gtttctatgt tgatgtaatg aaccaagtat 18480 
tctagggtta tggaagaatt tctcatcagc aaatccagct acttatgtct ttgtttcttg 18540 
cagcctaggg ctgatggaag tcttcacacc ccttccaaga acccccacaa acattccacc 18600 
acccccacat acccctgcaa gaatgaagct ttctggggct atgttctcct caacaccacc 18660 
acaggggtcc tctctcactc cttctgctcc ttttttctca ctagcgtcta ccctttcccc 18720 
aggattccag cctatacatt ctgagttcaa catatcaaaa ccagaacttg gcatccttcc 18780 
tgccaaactc attgctcttt tctcacttta tttctgttca atcattcagc cttaaaatgt 18840 
gggtgtctgg tttcaacttc tccatataat caatattatg tgttgatttc accaccacta 18900 
tgactcgagt ctatccccat tttcttcttt tcccactgga ttatggcttt tgccttctga 18960 
ctgtcatctt catgatataa tccacttaca ggtggcggat caatttattg aagactattt 19020 
ctgatcatca agcttcttgt ctcaaaaaaa tatcagtggc ttcccatctt attttaagtt 19080 
caaactcaga ctaaaatatg agtctctcca taatgggtct ctgcctgcct tccttttgag 19140 
acttacctaa accctacact ctatgttcca gccaaactag actacctagt gttgccacaa 19200 
acatgcctct cagtctttca ctgctttggt catactactg attccatctg aaatagtctc 19260 
taccacattg agcacccact ctctactaaa ccctggagat accgcagaaa acaaaaccag 19320 
caaactccct gtacccttgg ggtttacgtc ctgggaaaca aatacatata atgatttcaa 19380 
atactagtta aatgatacaa acatagccaa atagagtcat gggacagaga gtgagctgta 19440 
gtgctacatt agatgggtgg tcagggagga caggcatcac ctgaagagat gatattttaa 19500 
ctgagccttg ttgcaatcca tagaactttc tgcagtgttg ggagacattc tatatctgtg 19560 
ctgtctaata tgtaggcact agcaacatct gcccctatta cacattagaa atgtagtgag 19620 
catgactgag gaaccaaatc tttcatgtaa tttaattttt attaattgta atttaaaaag 19680 
tcacatgtgg ctagtgactg tcatattaga cagtgctaat ctagattcta gaatatttca 19740 
gacaagaaaa aaaaaacagt aagcataaag gctttaaatg agagtgagtg tatattcaaa 19800 
gaacagaaag aaggccagtg gagctaaagc atgctgagga aagagacagg gtattttgta 198 60 
agtcatgata aggaattgac atggagttta aatgcaataa aatcactaat ggattttaag 19920 
cttgggagtg tagtgatctg atttaatttt caaaagatta ctgtggctcc tctgtggagg 19980 
aaggagaagt gataaattga catgaaaaca gtatgtgtca gtcagttttt gctgcataac 2004 0 
aaataacatc aaaatctcag tggtttataa caaaagcttt tttaaaaaac aatcacatct 20100 
gcaggttgca ggctgcagtc tgtaggtcaa ccatgattct gattacactg taggataggt 20160 
taggctaaac tccaggaggg ttcaggtctg cccgatgtgt ctttgtttgg gtattgaggg 20220 
tcaaggggca gcagctacaa aaagacctgc tctccttttc caggagcaca agaggctaag 20280 
ctttatcatg caaatatatt taaagcttct gctcatgtca tgttctctaa tattccattg 20340 
gttcaagcaa ggtatgcagc taaacctaat gctcatgggg cagaaaacac actctgtctt 20400 
ctaaaaggaa ttacaaagtc atatgacaaa aggcatagat gtgtaatttt ataacaatat 204 60 
ccaatgtaac acagagggga ctggggaggt tcctacaata gtctggatga cagatggtgg 20520 
tggcttggaa aaggtgttag tagaagagat gaaagaggta agattcataa agtattttga 20580 
agaaagtatg cacaggattt agtgagagaa tgaaagtaaa gtatggggta aagaaaacag 20640 
accattttcc aaagtgcttc tcagatgcta ccacagccct gtgaagctgc ttctccctcc 20700 
ctcaattgca gctgatttcc ttttctctta tctcactggc tctctttttt gtgctgcttt 20760 
tattggccca tttcaacttt ctaccttctg ttcttgagag gttttatcta ttgttaagac 20820 
tctgaggttt tgagggcaag agaataaaat tatttgtatt tgcaacccaa ctgcctaggg 20880 
aataggctca ataaatattt ccagggcaat acaatggtta agagcaagga ctttggagat 20940 
atcatcctca ttttggatct tgtgtgatct ttataattga cttaacctct caaagccttg 21000 
gtttttattt ataaagtggc agtaatcata cactcctttt acagttgtgc agattgaagg 21060 
agatcaactt ggaaagtgct tagcataagg cctggcatag agtaagtacc cgataaatga 21120 
gagctattat tgtttgttca atttaattgt ttgtgatgac aagcttcttt ggaaatgccc 21180 
tccagagtct tcatcagaaa attcaatgtt cttgttgaaa tgaggggcac agagaataaa 21240 
aacactctgg atgggatttt gtaatttagt aaggagatct ctagatctaa agtctctttg 21300 
atctgaaaat gacagcaaca acaaaacccc aaaatagagc cactttgcta cttgtttcct 21360 
tttaccatct gccaagatgg ttctttttag ttctcgaggt cttgcttgga gtcagctgcc 21420 
ttaaaataat aaaataaaat ttaaatctgg tcctgcactt atttgggagc gagaagacaa 21480 
actgctttaa ctaaatatat agctagttgc tgctggagtg aaatataggg caatgttgac 21540 
taaagtgttt gaggaatgaa ctgctctggg aaagtgaatt ttaggataaa actgaggatt 21600 
aatcagaacc tgactttttt tttggtaaca tttgtaattg agacagtttc cgacatgtac 21660 
ttttcaagct ttcttgcctt agtgagatga ctttgttgat cctagtttaa agttgccttg 21720 
atcataagta tcctgtaaat ccagcgggca ttgtgaacag agattgtctg cgattgcaga 21780 
tgagaacaga aaggttgggg gtgaaatcgc tgaaaaagag tgcatgaaag aggtgaaaag 21840 
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ttaacatgat gagtttggat atctgcagga gggtttctcc ggagctctgc aagatacatt 21900 
tggaaaaata ggctgggttc agactagaga aggctttgaa ggtcaggaat aagtatttga 21960 
gagacaaaat aatatgttta agttgcattg ttgatattcc aaaggtcttg tttataagca 22020 
aataaaggat gattttattt agcaagtact agaataatcc tcagcaaaca taaagttacc 22080 
tgacatcaaa tgatgcagaa tatccacatt tctacacaat gaactctaca gggttcattt 22140 
ttaaagtgtt ttaggagctc atttagtcag tgcttcccat tagaactgtg atgttctgag 22200 
gatattaata ggttatttgg ggagcattgg gggtgatgag ggaaggttgc atggttataa 22260 
atttagtcat gctgattaag caaagtataa ggggtgctgt gttttaggat tttgaagacc 22320 
ctttaaaatg ctaatgtgtc ttgttaattt ctaaaaaaga ggatacagta tgtattttcc 22380 
aaacttagtc cctctgtagc ccacagaaca ggcgggtctg attgcttggt gagtgacaga 22440 
tcaatgacca caaccaagga gtattttaaa aagggatttt attacttgca acaagtaagg 22500 
atggctgaca ctgagcatag ttcccaaagc agtgcctccc caaataaagg tgaacccagg 22560 
gcttttattg acctggttgg ctgaatcatt gtatgtagag gtgaagtaaa ggcagtgcag 22620 
gtgcaggtgc agtcacagat catgctttta tacttgtggc atgtatagga aatgttgaat 22680 
aagctcatct ctggatgggg attttagtac gttaataaag ggagttcacc aaagttcatc 22740 
tccaactcag gcatctcaac cagtttttgt tttttgggct tcttcctgga attttttttg 22800 
aaacaagaac tcaaggtgca acagttacaa gtgtgtaatt tctcattgtg tgtacccaac 22860 
aatctgggga ccctgagtta taactctttt tgctgaattt gtgaataatg tcttgtgatg 22920 
tttaatcctg ttttacatat ggtgaaacta aggctgagag catacacagg gatttatctg 22980 
cagtcaggta gcagccaggt cacgatgcta acacagaact tttgaccgtc acccaaggcc 23040 
tcctactctt ccaggctccc tgtcaatttt tggagccagt cctctaaaca gttttaccca 23100 
gaacaaggaa ttgccggtga tttcagaaag aactggcaga ctcgggggag gcacttgcca 23160 
aattggctca ggctggcctt gctgccatgc ttcttttcct cgttcctgca aagccagttc 23220 
taggtaacct acctgcagat gtctaccctg aaataacacc cagcctctgt tacactgctg 23280 
ttacacacaa acaaacaaag caaatctgct gcaaaaagag gcaggttgaa ataaattatt 23340 
tttagtgcaa catataacgt caatgtcaaa aatacgaaag aggcaggttg catagggagg 23400 
ctgaggagga gatgctgaga ttcagtgtgg aaggtgacaa gataagagga ggaaagtttg 234 60 
ccctggccct gaggcagcaa aattaatttt gccccaggcc tttaactgag ggaaccaaat 23520 
ctatttatta cagaaattat aaagagattt ggtgactgac cccactttac cttgagtgat 23580 
tggactttac acgcaggtaa taggaataat agtgtcatat ttatagagca tgttcaggtg 23640 
gttgaggata tggaaagcct ttccagtaag gaatggttgg aggctctgag aaggaaaaac 23700 
ttaggcttgg atttcatcat caaacttcta aacaaacata tttggaaagg tactgcattt 237 60 
tctatggccc aagaggccaa aacacagaca aagtgagtac aacatttggt ctggcagatt 23820 
ttgactccac acgacagagc ccattctaag tgtgtggact ggctagcagt ggcgagggtt 23880 
ccctttctct agtctgcaaa cctcccaatc aacatggctg ttagaaacct ggtccttttg 23940 
cagaattgct ttaccacttc taccttgatg acccagtgcc tgtgatacca ccgttagaaa 24000 
gtcatctaat tgcgtttaaa cttgttctct ccagcatccg ttgaatagca aagacaaatc 24060 
cttctgacca tagttctaat tctgtgggct ggcactatgg atttaattgt gccctcccaa 24120 
aatccgtttg ttaaagccct aacctccagc atgatggtag tgggtttagc tgtcgtcatg 24180 
aggatgtcat cctcataatg gattaatact ctataagaga caccagaata cttgctcttt 24240 
ctctttttct gtgcacatgc accaaagtaa ggtcatgaga acacacagca agatggcggc 24300 
cattggcaag tcaggaagag aacctcacca tagtggcacc atgatctcgc acttccagtt 24360 
tccagaactg tgaaaaaaga aatttttgtt gtttaagtca ccattccata gtatttggtt 24420 
atggcatcct gagctgacta acaccactgg taattaaggt gtttcttcca tgaaaaatga 24480 
aatgatggtc aataggaagc aaacagaagc ttaagcagaa gggcagaaag gagtgcattt 24540 
gggaaccaga gggagtccta tagagactag acctgggatt ttgacagttc tgctgagggc 24 600 
tggccctgtg cccagcctcc ttccacttct cttcatctca ctagaccaag aaaatatgca 24660 
catctgttat ccacaaatac tctgaattcc cccgccaacg cagatccccg ggaaaaccta 24720 
acacaagcag aacagtggag gagatgcacg gggtatttgg tgtttctcag gaaaaaggct 24780 
tgattttttg acaccctcta tcttggtctc ttaatttgtc ctgagtcagt gagccacagt 24840 
gaagtccaga gaaaaaaatg gaaataggag aatcgggctt aaacctcagt tttgtcgctg 24 900 
attaacctgt gatcttgggc aacccactga tctgttcttt gtttcagttt ccctccatga 24960 
ccctgagtac tttaatagcc tctattctcc catgtatctc tgttgaacag atcataaaga 25020 
gtagtgatta tgaatccaga ctttactgtt actgtcaaat atcacattaa tttcctcagc 25080 
cactttacac ctgtgcaacc tgaggcaaaa ttttgtaata tctctgtttc agtttttata 25140 
gctgtcaaat ggggttaata acaatcccta tcacaagcag tctttacagg catggaatga 25200 
aaaattgctt gtgaaatgct gagtgtaagg cctattatat tcagtaacta ttacctctat 25260 
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tttggttcaa caccattttg cagtctcccc caaacaccct gctgcaacac acctctaagc 25320 
cttggctatg ctggcctaaa aaacttctag acacccttga tgacccaaat gaaaatcttc 25380 
tgaacccaag gaatgctttc ctggctgtgt aggtggcaag catcacactc tcaagggagt 25440 
aaaactgtgc ctttctgcaa ccattattgc ccataaccca tcatgttgtg atgcgttttt 25500 
tacaagattg tcttttgttg tctatgcttt tggggttata tgcaagaaat cattatccag 25560 
accaatgtca catggttttt cccctatgtt tttctccagt cgttttacaa tttcaggact 25620 
tacatttaag tatttaattc attttgagtt gattcttgtg taagggatga gaaaacggtc 25680 
caattcaatt cttctgcatg tgaatagcca gcttttccgg caccatttat tgaagagact 25740 
atccttttcc ccttgtgtgc tcttggcacc tttgtagaaa atcagttgat catagacttg 25800 
taggtttatt tctcagctgt ctatcctatt ccattggtct acatatctgc ttttatgcta 25860 
gtaccatgct gttttgatta gtatagcttt gcaatatatt ttgaaatcag atcatgtgat 25920 
gcctgcagct ttgtttttgc ttaaaattgc tttggctgtt tgggattttt ttgtagttcc 25980 
atacaaattt tgggattttt ttctatttct gtgaagaatg atgttggaat tttgacagag 26040 
attgcattta acaagaaaat taataaccct attaaaaatg ggcaaaagac ttgaatggac 26100 
attcctcaaa agaaagcata caataatgga caacagatat ataaaaaaat gttccacatc 26160 
attaatcatc aggaaaatgc aaattaaaac cacagcataa tatcacctca gacctcttag 26220 
aatggtatta tcaaaaagat gaaaaaaagc aaatgctggt aaggatgtga aaagaaaagg 26280 
aaacccttgt atactgttgg aaatcatgta aattattgcc ctcagtttac aaaacaatat 26340 
gaaggtttct caaaaaatta aaagcagaat tatcatgatt caaccaattg aatcatggta 26400 
aacatctaaa gttctgggta tatatctaaa gaaattgaaa taaatatgtc aaagaaatat 26460 
ctgcactctc atgttcattg cagcattatt tacaatagtt aagatatgga aacaacctaa 26520 
atgcccatca acagatgcta ggataaaaaa atgtggtatt gctacataat gaaatactac 26580 
tgagacttta aaaaaagaag aaaattctgt tatctgtgac aacatgaata taactagagg 26640 
acatcatgct aagttaaata atccaaccac agaatgacaa atattgcatg atctcattca 26700 
tatgtgaaat ctaaaagcat caaactcgta gaagtaaaga gtagaatggt ggttgcagag 26760 
gctggggact tggggcatat gttggtcaaa gggcacaagg tttcagttag acagaaggaa 26820 
taagttctgg tgagatattg cacaacaaag tgactgtagt taataatata ttatgtattt 26880 
caaaatttct aaaagactgg attttaaatg ttctcaccac aaagaaatga caagtatgtg 26940 
aggttatgaa tacgttaatt agcctgattt gttcatgcca caatgtatat gtctatcaaa 27000 
agatcacact gtaacacaga aatagatatt atttgtcaat taaaaacaaa ttttttaaaa 27060 
aggctatttt tcttgctgaa tcccacatgt ctctggggct agggtagatg tcttttttaa 27120 
ccagattgtg tccagcttag tccggtgttc aataagcatt aagtgtctga agggtggtca 27180 
atggacaaac agatgaaggg ttgcatagat taatagaagt ctaccatatc taggctggga 27240 
gtggtggctc acacctgtaa tcccagcact ttgggaggcc aaggtgggcg gatcatgagg 27300 
tcaggagatt gagaccatcc tggctaacac ggtgaaaccc tgtctctact aaaaatacaa 27360 
aaaatcagcc tgacgtggtg gcgggcgcct gtaatcccag ctgctcagaa ggctaaggca 27420 
ggagaatcac ttgaacctgg gaggcagaga tttcagtgag ccaagatcac accactgcac 27480 
tccagcctgg gcaaccgagt gagactctgt ctcaaaaaaa taaaaataag aaatctacca 27540 
tatctacttc tcaggcataa atatgataac tatttccttt aatttcccat tggataaata 27600 
actgtttata aacatctagt tgccagaccc atgcttggag tctcagggct aaaagaaatg 27660 
ctccccattt gctctatgtg ctagtcacat caatcccact tgatccccac agcacccttt 27720 
gggggaaaga gagtagttga taatctgatt ttaccaatga acaaaggctc agaaaagact 27780 
agttgatcct aaaaacacca cacagcattc taatgttgta aaaatacctg gatttgagtc 27840 
ccctgtacac tgatgccttg tggtgtggca ttggctcaga tgcttccctc ctctgggatg 27900 
tggactggcc tctgaggtct ctttaacctc tggaatcatg tgagctggag cacctgttgg 27960 
ctggggagac aaaaggctgc tgcgtccaga tctcccttca cgaggaagaa atgtgtctgc 28020 
acttttcata tcaaagcact tgggcagcat gggggcgtat tgtgatgagg ctgatggatc 28080 
ctccctgaac tggtgcctca gcgttcccat gccaactgag ggaaggaaaa gaaaaaaagg 28140 
aaaaacaaaa atcaaacacg aaaactgagc ttgatctaga gctgaagaaa aatattctac 28200 
tgatggcaac attcatggcc gtattatgaa gaaaaacaaa ttcataaaaa cctcagcttg 28260 
gccttaatgg ttggatctta agatgagtga caaatgacga aatattgggt atgagagaga 28320 
ggcaggaggt gggaggagag caggccctgc aatcagaatc acttatttgt aacatccatt 28380 
cacgaataca ttgccctcag ccacgcagga aacccacaca ggcctagagg atgctctctg 28440 
cttttttcgt ggtcctgcca ttcccaccag tacccagagg cccgaaccag cgaggagaaa 28500 
acattcactc tttctctgaa tattcaattg aacttgtttt cagtttcctg acagcatcca 28560 
gatccccaga gtgcctgcat ggctttattt tttcatccac cgcctgctga ctggtccctc 28620 
tcttaatacc cctcctcccc cagggcttgc ttcttcctgt tttgtgggga gcagaacatt 28680 
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tgagcttcca ggggtagtaa ccactagtgc tggtaagcac agctttggaa atcaggggag 28740 
ggaccctcaa agaaccatgg aaagtcccct gggccctcag aaagggaggg atcaaattaa 28800 
gtaagggaaa cagacatgag atatgggcag atgggaagct aagtagctgt ggcagctcct 288 60 
tataaagaga tcacagctgg agatgcgttc actaatccca cagaataatc ctctgtttct 28920 
ctttcctgaa cccccacatc tagttcatca gcaagtctgt cagtgctagt tacaaaatat 28980 
attttgaatc cacacacttt tttcaaaaaa aatctctacc ataaccccgt atccctcaca 29040 
ctgtgatctt tttcccatat cactacatta acctcctaac ctgtcttctt gcctccttcc 29100 
agacaatgtt ccacctggag gctaaagtga ctttttaaac ctattgagct gaacattcca 29160 
ttcctctgct taagtaaatg tagactccct tccttggcct acaaggccct ccatgaacct 29220 
gtccctccct aaccttacct cttgcccctc tctcatgtaa ctggatttca gccttccagg 29280 
ccttccttca gttccttaaa caggccaagc tgccagggcc tttgcactgg ctgttccctt 29340 
tgcctgggat gccgttaaat ctcatcatgg ctggctctag ttcactattc aggtcccaat 29400 
tcccacgtta ctttctcaga gagtgttccc tgaacactta atctaaagta aatttctaat 29460 
tgccctctct tccagcactt caacatcctt caaagcattt tttttaaata gcccacaggt 29520 
gctactaaac tttgaagctt tttttttata attagtaccc attttgccta cttacttgta 29580 
ttatctgatg tcttttttta cctgctcagt gtaagcgcca caagaaaagg atctgcttct 29640 
tgttccttgt atcctgtgtc tagtctgggg ttagcacaga ctaagtgctt agtagacatt 29700 
tatcaagtaa ccacataaat gaatgaattt ctgaagtggc tatttctttc tacctggatt 297 60 
tgagctcagc cctggggata cagacatgaa aaatacacag ccactactct gaagctgctc 29820 
atggcctaat gagggaaact gagaagtcag tagtttcagg attaaaagcc agtagtttca 29880 
ggattaaaag tatgataatg tatattatta atcatgtgta tacatgtaca ataatagtaa 29940 
tagtggccaa acttgagcca ttatgtgctc agcactgtac aaatattatg caagtatttt 30000 
ttttcatata attgccacaa taaacttatg agttagtatt atcacattta aactacataa 30060 
aaatgcattt ggcagatgca gaaatggaga tactgcgagg ttcaaataac tgggccatgg 30120 
ggacagttag taagtggctg agtgaattca gcctggcatc gtgctccatg tcagagccag 30180 
tgctctgaag cactagatca gtgattctca aaagatgacc gtcctggact ggtagagtca 30240 
gcatcacatg gcagcctgtt agaaatgcca gttcaaggcc aggcacggtg gcttacgcct 30300 
gtaatcccag cactttggga ggccgaggtg ggcagatcac aaggtcagga gatcgagacc 30360 
atcctggcta acaccgtgaa accccgtctc taccaaaaat acaaaaaatt agccgggcat 30420 
gatggcacgc acctgtagtc cagctcctca ggaggctgag gcaggagaat cgctggaatc 30480 
gggaggtggc agctgcagtg agccaagatc acgccactgc actccagcct ggacgacaga 30540 
gcgagactcc atctcaaaaa aaaaggaaag aaaagaaatg ccaattcgag ggtctcaggc 30600 
cggacctcct gaatcagaaa ctttggagtt ggggctcagc actctgtttt aacaggcccc 30660 
gcagatgatt ctgaagatgc tcaccaaagt ttgacaacca ctacctattg aatgacttaa 30720 
ttttatggga tactagatca gactagatca gagtctgaag tttttctgta actcagttct 30780 
catcttaacg ctgtgagtgt agattgctct caatccacgg ctgaagaagc cagattgtga 30840 
ctgaataaag ccagttaccc tgcctgaccc tggctccagg acctgcagca tgtaaaacat 30900 
catccaggag tgcagcccct gtgaagacag ctgacctgaa gggcatggag tctgtgaccc 30960 
catccacctt gagaagcatg cttcaaccag caaaggaaag acagtcctag acaaggaaac 31020 
atcatcaagt ctctctgtag gaatacttca tatcagacca tattgccatg tgttgaagat 31080 
tattatataa tgataataat tagtacaata gtaataccat acattttgat tacatttttg 31140 
acattttcaa ggcatgttta tattctaata atctattttc cctcttgata attttatttt 31200 
ctcccttaaa aatctctcca gttacacaaa gtagagtagg agtgaaagga ttgttaatct 31260 
gttaatctgc ctgtccaggg gagaggcaaa gtaaagagat gtatctgtga tttcatggcc 31320 
tgttaggttc gtagtaacaa tagctaatgt gtgttaggga gcttaagtgg gccaggctct 31380 
ttgctagata tgtcagtgct tattaattta ataaaacctc tagatccaag gcttccagac 31440 
tcccaactgg tctccttcag caagcagcgt ttcttttctt tcacctcact ctgaattctt 31500 
cagactctct agtcattcat ttattttatt cattcaactt ttatttagct tatacatcct 31560 
acagagaagg tgaagggcac agaggtgaat gtgctggccc acaaattgct agctctctca 31620 
ggggaatcca gaacaatgca gtgccaccca cagtggagcc ttgaaggaca aaggcaaatt 31680 
atttaccaaa tcaaaagttg ccagattcaa gattctctgc agttctttga ggtttgagct 31740 
tggtgcataa gaagaaaggg ggattttgtt ctaatgaaag acaggacaca agggcctacc 31800 
tacaaaaatt gctgcaataa atgaggacat cattaaaatt gtctttgtgg gggcccatgg 318 60 
atatggaagt gagtagggaa gatgtataca agcaatggaa ggaataaaag acagaagact 31920 
tcctagatct agtgctagag gctccatagt tctgttcctt acaaatgctg taactttggg 31980 
ccagcctccc aacctattca accttctgct tccttctcca tgaaataaga acctggattt 32040 
cttactcacc tgattcacag ggaacctctg aaaatcaaac gtaagaatta gtaatagtaa 32100 
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taataacaat ataaatagct aacatctatt 
tatttcattt aatcatgaaa gcatttccat 
tagattataa aaacagaggt ttagtaaatt 
gaacaaggat ttgaattcag atctgtctga 
cccagaattc aaaaacaaat gaggaaaact 
tcaaaagagg gaacaaaagt gagggcctgg 
tggaaagcat ttccagatac catgtcctga 
ccctggacag tggaccaagg tttaagagtc 
cagagtaaca catgccataa acagaagaaa 
gcagagctgg agtcaggcag ctgcagattc 
aaaggtacat tttatttaac tccatgccca 
gaactacagc ttccacatct ggctcagtgg 
agaagcagct cagtcttgct gccttccctt 
gcttcccaga gccttttggg agtggctcag 
aatagcttca gatgtagctc tggtggttga 
agctgacatt cattgaactc taatcagggg 
cactgttgtg tgtcttgctg ctctcattac 
acattgaatc tgatttatgt gtgagttgag 
acatgtttta tagatattta aatatttatt 
tcagagtcaa tatatttatt tttctggtca 
actgtaggac ccagacactg agttttctca 
ttgtgttggg agctgagtgc tcccacacat 
taaactgcct taggtttgaa tcctgaaccc 
gtgccatatc ctatttgtgc ctcagttttc 
aatcagagaa cacataaaca gtttaaaata 
gttgttatta ttgatattta aaatgtccca 
tgaaaatgaa gtcaatttga tgaatggtgg 
gtagagctca tattcataat gagtaagagt 
cacagtaaca aatgaccccc aaacttccat 
tcatgctgca agtccaacac aaaatggtgg 
ttccttaggg acccaagctg aaagagggac 
cttatgtaat aagccacacc ctggctctaa 
tccacacaca gttcattgga tcaagcaagt 
gaaaagtgaa atcttaccat gtaactgcaa 
gaaggttttg aggatcaaat taattaatct 
tagaaaaatc ttctttaatg ttaggtatga 
agcagctctc ctgtataaat atcttccacc 
tctctctctc tctctctctc tcaatgatcc 
gacctctctc ttatcatgtc atctctcttg 
gtcctaagca agatatctga gggcaatccg 
ttctcccact tttctctaag gctgtgttct 
ttagactggc acattcctac cccatccctt 
cttctcaaat gcctcttgcc tttctgcaat 
tcctccaaga agcctcattt ttcaccacag 
ccacagtact actcatttgg ctgaacttta 
ttgaacattt ttttttttat ttatgcttat 
acagcatcag actcatattg atctaaattt 
tgtgcaggtc agccatgttc tcgagctcct 
tacctatttc ataagaatgc tgcgaggata 
ataggagatg actgtcctca tcatagctgt 
ataatagtca gctcaatgac tatttgttga 
tcttactgca tgctagtttt ctccaaaagc 
taaagttctt tggagtctta ggcaacggtt 
aatgttattg aaatgctagt agttggaatc 
gggccaggcg cagtggctca cgcctgtaat 
atcacaaggt caagacatag agatcagcct 
aaaaatacaa aaattagctg ggcgtggtgg 



gagcaattat atttcttact ttcactggct 32160 
gaagcagata atgttttagg ccccagttta 32220 
actaagttca cccaccctta acaagcagtg 32280 
tctttgaacc tattactatt aaatcactac 32340 
ctgcgagtca tgctgatgaa atgagcggaa 32400 
ggtcagaggc tgttagcttg gcatgaagga 32460 
gtgagcacaa tgcttaggct atgcagcaga 32520 
agtgtgaact ttcaggaaga agagtttatg 32580 
gtatcaggga ccagagaaac agagtaagag 32640 
caatggcctc tgactttctg ctgtagctgc 32700 
gccctgtggc agtctgggag ccctttgaca 32760 
ggaaggagaa atagatcttc ccgggcaaaa 32820 
gagtctggac agaaccatcc tcggtggtga 32880 
gggaccaatt tgcttccttg aagaattttg 32940 
agctaaggac aataatgaaa ataatatgat 33000 
cttgttttaa ggcatatttt aggctctatg 33060 
accaaagata ttatagcaca cccacatctc 33120 
tggcattgct gcacactagt tggtgcaatg 33180 
aatttgattt tacagtttgc ttttaatatt 33240 
gacatacttt ttaggcccac ccttcaaaag 33300 
tgccacctgg agaaagtgat cctgtaatta 33360 
tgcctgtgga taagtgtaag actctggaat 33420 
actactttct agctgtgtaa tctttggcaa 33480 
tcatttgcaa gatggttatt gtaaggactg 33540 
ctacctgtaa cctgtatcat ggcatgtaaa 33600 
gtgaggtagc cactatagtt attcccattt 33660 
agagggacat tgaactcagg tggtctgacc 33720 
ttgtatcagt caagttaggc caggttatac 33780 
gggctaatat aacatgagtt tatttcttgg 33840 
gggagtgggg agaggcattg ctcctcatgc 33900 
catcctaaca tgatcacagt gtcaaaggaa 33960 
agccttattc ccaaggtcac ggacatcatt 34020 
tgatggccag gcttaacttt aaaggaggta 34080 
aattgggaat agccaaatgt tatctcagag 34140 
atgaaattat caaaaatagg gcctagaaca 34200 
attcttatta attaatgaat tgcctcttgt 34260 
tgattctctt gttcctgata tctctctctc 34320 
atcttgcaga cctgatccat tttattatta 34380 
ttctcccttg tctgattggg taagttcaga 34440 
taattcactt ctaacctact ctggaatttt 34500 
aactggaagg actcattcac tgttcccaaa 34560 
gatattcttc attcttattg tataatctcc 34 620 
tttatatccc tcattatgca gtttaccact 34680 
cctgaaatga tttcattcag ggctgaactt 34740 
ctactcattt gaccctgaac tcagactgct 34 800 
gttgttactg ttgtggaaag aacacagagc 34860 
caggtctgtc attgactggc cttgacaccc 34 920 
tgcccctttc ctgtgacatg aaagtgatta 34 980 
gaattgaaaa caagttatag tgacctacat 35040 
gtccctagta tcaagaacag tacctggcaa 35100 
ctgaatgaat tattagtaat caccagccac 35160 
tgtaagttcc ttgaagacag ctttgaaaag 35220 
ctcaagctta attatacagg agaagcaagg 35280 
tggatgtact gaatcagaat tcctggcaca 35340 
cccagcactt tgggacgctg aggcaggcag 35400 
gaccaacatg gtgaaatccc tgtctccact 354 60 
ggctcacctg tagtcccagc tactcaggag 35520 
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gctgaggcag 
ccactgcact 
tggcacagag 
tatgctttga 
ggtagggctt 
tagctttcct 
gcattagaat 
gcactaggtt 
ttgattctgt 
gttggccaac 
gaacctgtaa 
aagatcactc 
tttggcagaa 
cttaggatct 
atcacctatt 
aaaaaagtca 
gtgcagtgat 
aatgtctgga 
aggtaaaagc 
ataattatct 
gtactcatta 
atgtaccaca 
tagttctttt 
cctggatggg 
gtgggaaggc 
gtggttagcc 
cctggcttgg 
actccactca 
actttgttat 
taatttagca 
agtcattttc 
tggctactaa 
tctgccctga 
atgtaccgca 
cagtctatga 
ctgcagcata 
tcatattcta 
ggtgatagat 
attaacaacc 
aatccaaagg 
gaataaagtt 
gctttattaa 
ctgaaacaca 
tggactcagg 
atgctaaatt 
ccacagggtt 
tagcacttgg 
cctagtgtgt 
tatgccttcg 
atactgaaat 
tgaaaaacac 
tggggaaggg 
tcatattagg 
aggagatgca 
ttgactggat 
tgtgtttcca 
gtggcaggca 



gagaattgct 
ccagcctgga 
tctaaacatc 
gaaatcttat 
tgtaagacaa 
gcatagctaa 
tgaattaatt 
tgaaagtgaa 
ctttgtaccc 
aaactttgtt 
tatagtaaga 
attaagtcca 
ctgtctttgt 
cttaaggatt 
gaattccagg 
actcatccaa 
tctcagctgg 
gacactttta 
aatggatgtg 
gggctaaaat 
actatccccc 
aaattaaaaa 
tttggccaag 
ctactcagag 
aaaacttggg 
ctgtagctgg 
acttctcaag 
gtaagttgga 
ctgtgatcat 
acagt cactc 
ttgtggtgaa 
acttgctcat 
ttacattaat 
gttgctcccc 
attaacactt 
tcagcaaaat 
agcttatact 
tgcagaatgt 
tgtatcagtc 
aactctgcaa 
gtgtgaatgt 
ttcatgcaca 
aagagggtga 
caaagctgat 
ttaagagcct 
gtttgtgtga 
tgggtgctca 
aaatagtgaa 
tgatgctgat 
cgcaaaaatt 
aattttgaaa 
attagtgtat 
cagaactctt 
tattggagct 
taaggaatat 
gaggagtgtg 
cccttcaatc 



tgaaccttgg 
aacagagtga 
tgtatgataa 
tttggaagag 
ctcatctata 
ctcatttaat 
tgtgttcact 
gctggtcaca 
agtgtcttct 
gatgcattga 
tcctacaggt 
tctcatagtt 
ataaatagtg 
cgactattta 
tcttagcagt 
ttgttaagac 
gagtgatttt 
agtgtcacaa 
ctaaccatcc 
gtcaataatg 
tcaaaacatc 
taaaaaagag 
tgctgggcac 
tgttccttgt 
agataagggt 
aatacaaacc 
atagttttcc 
gagtctctgt 
ggtatataat 
aacactgtaa 
aaggtcctgg 
gggcctctgt 
gatggagagg 
aaaagtgaaa 
aggaatcact 
gaggccaata 
ctttttttgt 
ttggttgtgt 
ccagaaatgt 
agtctcctca 
aaactggatg 
atcatctgca 
atatgtgcac 
gtccctgctg 
cagtttcttc 
ctaagtgagc 
gggttgtttg 
tctgtgacat 
cataatcttg 
caaaatccct 
gattaaaatt 
ttttggttgt 
atcctgttat 
gacaaggggc 
ctaaaacctg 
tgacctaagt 
agcagagggc 



aggcggaggt 
gactcagttt 
caaacttccc 
ctcctcttga 
aggatgaata 
tccaggctag 
taattgatat 
cttttttaca 
gtatagtatt 
aaaatacagt 
ctctgtagga 
gattgtagtc 
aacagataca 
cttaactgag 
ttattagaag 
gatttatctt 
gcccctttcc 
ctcaggactg 
tgtaatgcac 
ctgaggttaa 
tcatgtaccc 
tataaaatta 
tggttccctc 
tcagaatcaa 
catctcaatt 
acaaccctct 
ctcctcattt 
gggtatacaa 
aatcaggatg 
aagaaaacta 
gtttgcctca 
ttctttatct 
atctataaga 
cttgggattg 
gcattattat 
tagtaaagca 
tcaaatacct 
tttaaaacaa 
ctttaaatat 
gcaggaatgt 
catttggtat 
ggaggcagat 
agggcagcaa 
agctcagcca 
atctgcaaaa 
caatataggg 
tttcattcat 
ggaggctctt 
agaacacaat 
aaagtctaaa 
gcaaatcttg 
acacaggata 
tgtcttcatt 
agacatgtgg 
gtaaaacatt 
ggattaggcg 
acaggaagaa 



tgcagtgagc 
aaaaaaacaa 
aaatgataga 
ctggatacaa 
aactttctat 
atcacttcta 
agtgtgtgct 
gtaggcttct 
agctcaggga 
ttcctaattg 
aatattgcta 
cagtggaaat 
ttccatctta 
gtacagccca 
tgtaaacaat 
ttttctttca 
cttcctgggg 
gagaactact 
agaacaaacc 
gaaacctcat 
cataaatata 
aaaaaaaaaa 
tgctatttca 
cttcagctgg 
tctcctttct 
ctcctgattc 
ccctggcttg 
ctggcacttt 
aaagtcaagt 
ctcctgtgga 
gtgggtctag 
gtatgatggg 
tggctcctaa 
ggaaaacttc 
ttgaagaagt 
aaatgtatat 
tttctattat 
cttacatgaa 
ttttctgatc 
tatcatggat 
gttaaaaata 
cctatttgcc 
agcaaaaaga 
cttatcagct 
atggaaatag 
aagaggctag 
ttaaattatt 
ccaaatttaa 
cctgaactcc 
atttccaata 
aaatcttgca 
ggtgcatcgt 
tggaaattaa 
gcttaatttt 
acttttggtg 
gggaagatct 
caaatacaga 



caaggtcacg 
aaagaattct 
gtctgaagat 
atcttttcct 
ccttctggac 
aaaaatcaat 
tttctaaatg 
ctaataacac 
tttgtacgtt 
caactccagt 
ctgaaagtca 
atttgctata 
gcactgccag 
tgggacacac 
gcttccattc 
tttcagccta 
gacatgtggc 
attgacattt 
cctacaacaa 
tctatttttt 
tatacctact 
ccacctagtc 
ttgtgtggaa 
cttactctta 
ttaagcacta 
actataagaa 
gttggaacac 
gattgccgca 
ttcctattag 
aattaaacag 
atttcagcct 
aataataatt 
gttctgtaaa 
tacacaatcc 
atatttataa 
tttaaaggac 
gcaatgttat 
aatatcaaac 
tttgaaactg 
gatacattgt 
ctgttcacat 
cgggatcaca 
tcagcagctt 
tcatgactct 
taatactacc 
atagagagac 
tccagtatcc 
tccagtgcca 
gcaatcttga 
gctaaattcc 
aatcaaatcc 
gttagttaca 
ttatggttta 
aggtgtacac 
tgtctgtgag 
gccctcagtg 
agtcaaattg 



35580 
35640 
35700 
35760 
35820 
35880 
35940 
36000 
36060 
36120 
36180 
36240 
36300 
36360 
36420 
36480 
36540 
36600 
36660 
36720 
36780 
36840 
36900 
36960 
37020 
37080 
37140 
37200 
37260 
37320 
37380 
37440 
37500 
37560 
37620 
37680 
37740 
37800 
37860 
37920 
37980 
38040 
38100 
38160 
38220 
38280 
38340 
38400 
38460 
38520 
38580 
38640 
38700 
38760 
38820 
38880 
38940 
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gtctctctgg cagctggctg gcttttcttc tactgcactg gacatcagaa atattccaaa 39000 
aagtgtagtt tcacaatgtt cactttctcc ataagcatca tgtgtgtaca tgaaaacatg 39060 
gaaacttcct taataaatgt agagatgtct ttttgtatat cttcgttggt gaaatggaac 39120 
atttcttgag atctcagttc tgtgagcaat tgcaagtgtg gtgatgaccc attgcagttc 39180 
ttgatcaatt tcgtcaaaag atttaggttg tccatcatgt atttcagatg attgcagtag 39240 
taaagcttag tgcacacaat taccaaccat agttatatac atttgtaaat tttacttttt 39300 
gacttattac tttatgagta cagttcattt gcacatcttt gttgtaacca tgaaactgcc 39360 
attagtacac ctgagtgttt atgcttgcga aaatgtgtat gttattattg aatattttat 39420 
tgtatagagt ggactatgtg tgttctttgc atttttattt ttctcaaaac atatagacat 39480 
tattttatgt ttctcaaata aaagaaacat tgccctttta aaatgtgaat aaatgtcttt 39540 
taaattttct ttttattatt ttttccagaa tatgtttttg ggtttttgat cttttgagat 39600 
ttcaacattt gggattatgg tgtttgggat tgtgtctttc aggattatag ccaaacccat 39660 
ttcttctctg cctctgaaca tctgcaaacc cactattcta gtcctcactt ttaagggtac 39720 
aggctcagag gactaaactg aggttctcaa gcttttatgt gcatcagact cacctggaaa 39780 
acttgttaaa acacagcttg caaggccaac cccagaattt ctgatttggg aggtctggtg 39840 
tgaggacaag aatttacatt tctaccaagt tccaggtgat gctgatgctg ctggtccagg 39900 
gaccaccttt tgagaatcac tgagctaaat gtttctttct cttttcagca acatttcact 39960 
cctgtggttc ctgactgttg ttggcagttt tcccacaatt ttccaaaact ggtcttcatc 40020 
cagtgtctcc tctcatagta gacatcacat tcatccagaa aataaggcat tctcctatat 40080 
gccttcttct ctcttcctcc acgttccttc tgtcttcagg cattgtctca ctggtccctg 40140 
gcccagtctg tcaatcagct gacatcactg cccccacttg ctttgaatca tgcctcctcc 40200 
cttaatgtct atgtgtcctt gcataggttg gttaccagag tagatataca aagaatgaat 40260 
gagtctcccc atctttccga cctcaagaac ttgattagtg gtatcctgac tggaacaatg 40320 
gaatatggcc ttttgtctgc aaacagaaac aattatagca ggttttctca ctagggtcat 40380 
gtttgcctat tttatgttaa taagagctgt tgtattgact tagctgtctt tattctagtc 40440 
acttctgctt cacaataaga ggctatctgt gcttgcgttc cagggggtga gttactgtct 4 0500 
tatattgtct gacagggctg tcccttgagg aatatctttc cccaatgcca gttcttgtag 40560 
aggggccagg aggtcagaac ttaagctaac aaatgattat ctttctcagc agaattacca 40620 
gaggacctca gactcctggg ttcctcctta agggactcag gttcagatct ttggcccaga 40680 
acacactgta acctctgagt tcaggcttta gatgaggcac ctgtcaattc ctatgggacc 40740 
tcaacccagt ctcaaatact tggtcaaaag gtgagttgaa acttaaaaca gtaccagccc 40800 
ctggacagta taagttggtc tcatcacatc gaagtattca tgatgagcat cataaaatct 40860 
tctgagagtt ggttttgatg aacagataag gaagaatacc aacaccctaa aagttgaggg 40920 
caattcaagt ccagatgaga agaaacgtgc aggattgcag acttggaatc ccaaagaggc 40980 
acttagtaat ggttataaac tctgagggtt attcgtaact gatgcctggc cttggggtaa 41040 
atggatatcc tgaggctttg ggcagaacag ttgcaaactc cccaaccttc tgcttttctt 41100 
ggaggtgatg tttgtatgaa aagagcagtt ttctggaatt cacacaataa aatgagccct 41160 
ctcatttcag gccctctgct gggtaatctt gggcaatatg cttcccctct ctgagcctta 41220 
ttaccctgtt ccctcatttt cgaagcagat tctaacctcc gatctgcttc tgtggtggcc 41280 
tcactgggat attctgaagc tcaagtgctg ctgctgctaa taataataat accaattact 41340 
atagcaacag caacaactct aactgtgagc acttgtgctg tgcccagctc tgcggatggt 41400 
ctcatgtgtt ccatcagccc gttcttacag aagttctggt aggtagttac tatcattata 414 60 
cccatcccat aggcaaaaaa actcagccac agaaaggtta aatagcttgt ctcaggtcac 41520 
atagctaata gtggtagagt ctgtactgga acccagcact gtcctgcatt taactgctat 41580 
ggtacactgc tctgtagatg ttaaaattac ttttggtgaa ctttaagtaa ctatcaatgg 41640 
aagatgattt ctttgccatg cgacagtttg ctctccaagc tttaagcgga tcaccaagga 41700 
gtctaactcc tctggccctg actgcctcca tcttggtttc ttgccctgca tgggttcatc 41760 
ttttctttgt ttcccctctc tggcattatg aaaatgagaa gcagctgtga gccctgcaga 41820 
ttgcaaatgg gattgtgagt gttgctatct ctctaacaac taattcatta cccagtggca 41880 
gatagtttcc gctgtgctgt tacaatctat ttgacttcca ggtgccacat ggaataaaaa 41940 
acaatttgtt caacagttgg gcctgggtgt ttgcattcag acaggcatgc tttacctccc 42000 
tatgtccttg agggctgccc tctctggtaa ggagccttct ttctgggcag gaaaagctca 42060 
ctgcacaggt gatagacact gaaggatatg tctggacttg gaagtaactt gatctaacca 42120 
gccatacctt tcagaaaata ttccaaaaat agttactgaa cacctatgat gtgccagtta 42180 
ctgtgccagg agctggctat gcagtgacag taagatagac gcaagctctg ctcttaaaat 42240 
aagcacatct gcagataact aatagcagta acttcagata ataataaagg tcagggggaa 42300 
aataagactg ggtgatgtga cagagagctg ttggtctgga cagggacaga gagggatggt 42360 
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tgaagaagag atacatgagc taagatgaga atggtaagtg agagttggct gtgggataat 42420 
agtggggaaa aatattctag gcagagggaa cttcaagtac aaaagttttt aggtgggtac 42480 
aagttgagat ggttcaagta atagaaagga agcccacatg ggggttggga ggtggtgagc 4254 0 
cctagaggta gagtgctagg agaggtggtt gtagggtcag tcagggacaa atgatgcagg 42600 
gccttctaga ccatgggcct tcctggggat cttctctttc agccctctac gggcccagcc 42660 
aaccaaggca ggaactgcct tctgtgtatt catcatatct ctgaagtcca accccaaacc 42720 
tctccaaggc caagtctgtg tctttttctt tctaaatatc ctcacccaaa tcttacacag 42780 
ttcttgacac atagcagatt ctcaggatag cctgatgact tcaaggtata agttaaggaa 42840 
gatattgcag accacagata tattttattt ataaccaagg aagagcttat atttagcaaa 42900 
agtctgcact tcctcagtag gtactaagct ccctgttact agaggtacat aagcagaaga 42960 
tggaagatca gttggtggga atgtattgaa tagaggtagt agtcatggtg ttttgcaata 43020 
gagatagcag tagtggtcac aatgataaaa atagtaatat tcctatgtat taaacagtca 43080 
atatatgcca gaccctagat gtatattatc tcatttaagc ttcctaaaac aacctcataa 43140 
ggaaatacca ttattatttc cacttccagt cacagagact gaaacctctc acatttaagg 43200 
actgtgcacc tggggtcagg tagtgagtaa gtggtagggt ccacttgaaa acccatactt 43260 
ccagtccata cttgaaaacc ctctctaggg aaacttaatg attggagtcc atattttgac 43320 
ttcatgtatc ttggcttttt atttctatcc agagctttat tgcagggtgt agggggtgtc 43380 
actcttttac agcctttaca gcccttttac atatattagg taataatata gaattgaatt 43440 
ctaaaatagc tgaaagtatg ttttttcctc cagcaaaatc attccccaaa gatcatcccc 43500 
aaactgatga gtatcttgtt cctgtgagca ttactgtact gatctttctc ccgtcatgta 43560 
gtactttgtt atttgagtca tttcagctct tcagggtact caagagtagg tgctgttgcc 43620 
ttacactgtt taagcattct catagacctc tgaatttctt cagacctcac actcactcct 43680 
tgttgtgatt taatatctgt cttcccaaca gggactcaag ctatatcccc agtacctagc 43740 
aaatggcaag tggcagacac ccaatagaca ggctgaatga atgaatgcta agttctgtct 43800 
atcatcttca taccatggct gctcccattt agttggaacg tgtgctcttc agtttccaca 43860 
aaccccacag cagtgtcttg gactcctcaa cattgaggat gtcatgtatg tgtcatttat 43920 
tattatcctt gcttttgtga cacccaccag gctcactgga tcacattgtc accctagccc 43980 
ctgtgggcat ctggatttgg gaaccttaac ctgaggatct tggtaaaacc tcctggtgtt 44040 
tccggtttgg gaagtaggct gtagagtcaa tacctgcggt caaacatcag ccctacatcc 44100 
tcactaattg tgtgaccttg agcaagcccc tttacttcat ctagctaata aaaaataaat 44160 
aaataaataa aacagtcacc tctgcatagg tgtatggcaa ggactgagtc aagaaatttt 44220 
taagaacact tagcccagga cctagaagat tctagtaagg gtttagtaaa tgttaattat 44280 
ggttgttagt aacatcatca aatttttaaa atattatcac actgactttt tgtgaatcta 44340 
gacttgctag ctctggtgtg catccaatga tacaaacccc catattcaac attcctcttt 44400 
gaaaggtttc acagaatgtt gtctgagctg accttaaaga taccttgcag ctccaaagtt 444 60 
ctgtgacttt caccaggagt ctcaaggcaa atgcacagaa gggtcaggca agtatcgtga 44520 
atggaggctg gaccagggag cccatgcatt cttccaaagg catttcaatt cagtcttgtt 44580 
aaagcactga gttgcacaaa agaaacgtca ctgttggcca gcctcggttc ttcggctagc 44640 
tccttcaacc caagtgtatc atctaggatg gaggaggctt ctgagggact gagggaggac 44700 
aaatcttgga aagagaggta caaggaagag atgggatttg gataaggcaa ggaagagcca 447 60 
ggaggctttg gatgaactga cctctagata cattcatttt tatgccaaaa gtccatttcc 44820 
cactctttaa accccttcac ctcttctcaa gaaacttgtg tctcagcttc agagagccat 44880 
ggcctcattt aaaatgttgt gaaagaaggt gatggaagca tcaggttcct aggctggtga 44940 
atttttattt ttatttctcc attgacacag tttaaccttt gcttttggca gctagcatat 45000 
tgctcaaata aagtgaagag aagggtgtgg gggagaaggc aagagatatt tggctagaag 45060 
gttatgagaa tcacaatagc tgtcatatga gtgccgagca ttcttctaag cacatcccat 45120 
gcataatctc atttaatgct cacaatagtc ctctgaagtc taatgtctat ttagaaaagg 45180 
gaagcttatc tgaggtttag catagaataa ctcacccatg gtcacctatg tgaaaaaagt 45240 
gcagagctaa gaatggaatt gaaatctgtc aaactccaga gtgtagagac tgaccaagta 45300 
gaccccccac ttccaggcca atgcatttcg gcacttgatt tgtggcaagg agtcctctca 45360 
gggttttggg gctatgctgg tgcttgctat gctgcctcgg aaatgtcacc taacctagaa 45420 
atagtgttta ttctgttaca aagcaagttg gaagaaaaga aaacactcct tttttctttt 45480 
ttctgtgcct cctctcctta ctttcccctg aaacttaagt tccaaggagt gcaattagca 45540 
ggtcagcttg tcttgctgaa cagtcaggtt actttccctc agtatttgat gccaagtgaa 45600 
tgttgagctg aaaggctggc tgataaatgc ccctctgggg agaaggaact gtgaaatagc 45660 
ctggatctat tggcagatac tagggccaaa aagtccttag aaactcaacc tgaaaaaaat 45720 
gtctagggaa agcaaaacgg ccaactaagg tagtaaacac ttcagaaaac cttgataata 45780 
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gcaatttgga 
gatacagatg 
gttgctcaca 
cttggtatga 
ttcttggcca 
aatcaaggag 
aagggcatat 
ttttgttttg 
ccaggctgga 
gcaattctcc 
ggcaaagttc 
aactcctgac 
tgagccaccg 
taaatagatg 
tttacccacc 
tagcctgttg 
gcaacctcct 
tctccagtca 
ggacatgtgt 
tcaagttggc 
ggtcattaga 
tacctcttgc 
gaccaagaga 
ggagctgtta 
accgacagca 
caattcttgc 
ccacacctgg 
ctgtcccatc 
tgtgttctaa 
actcaatttg 
gacttttaat 
aatctcccac 
gccgtacatt 
agggccttct 
tcctaacctc 
caaagtgtgt 
gatccattgg 
tttgtgctca 
taactaactt 
tccggtcaat 
ttgtggttaa 
taacagacag 
aaatgaaaat 
ccagggaacc 
cattccaaag 
aactgcataa 
agaggatcat 
gagaccagac 
tgcctagcga 
ccctggctct 
cctgctctct 
tgatgtcttt 
agcaatctac 
gtcagaacca 
cagccagtga 
tactctcagc 
tttgtttgtt 



aaatcttgca 
tcttgcctga 
gaaagacagt 
gcctcagggt 
tctatgaagt 
tcagataaat 
ttgaaaaagt 
ttttgttttg 
gtgcagtggc 
tgccacagcc 
tgtatcttta 
ctcaagcgat 
cacccagccc 
tgaacgtgcc 
atccaccacc 
gagaaatgat 
ggcacatatg 
gctgaacaag 
ttaggtgagt 
actattaggc 
tgtacatatc 
actccctaac 
cttttgattc 
cctctactgt 
gctcctaact 
tgtcagaatc 
ggaaatccag 
tcaattgtcc 
acaatagcca 
ccagcaatgc 
cgtccctcag 
ccctacctcc 
agcaagattt 
tgtgctcatc 
ctccagccca 
tgaatagaat 
gcagatagtg 
tgttaaattt 
gaggttatgg 
tctctcttct 
agtttgaagt 
accttgggtg 
attaatgcta 
ctgggataat 
atacctgctg 
ggaaagagat 
aaacgcactt 
cctgcacatc 
agggccccgt 
ccctctctct 
gtgtctcgct 
agtgtgaaag 
ctctgtaatc 
gggtctccag 
agttgcaagt 
ccttttatct 
tttaataacc 



tgtacagttt 
aaatgtaact 
atcacgttgt 
catcattacc 
tggattttaa 
gtctggcccc 
gtattctaca 
ttttgttttg 
acaatcttgg 
tcctgagtag 
gtagagacag 
ctgcccgtct 
aacacaggcc 
atgtaaattt 
attgtttatc 
ctgaggtggt 
ataggccaaa 
caggtgctag 
aacgtgaatg 
atccattcct 
caaatccaag 
tcagtggcag 
tttttccctt 
ctccctctaa 
ggcttctgcc 
gtccttttaa 
cctccgtagt 
ttccacaccc 
gatttccctc 
tctctggttt 
ggccaagttt 
acaagaatta 
gtatgcatga 
tgtccatctc 
gcatctaagt 
ccattaattt 
aaaatttcaa 
agaaacattt 
agaaagggca 
ccatcagtat 
ccctgagata 
agttagttac 
cccattttgt 
aaaagcaatc 
tcccctctca 
gatgtctgct 
tgcaaagcaa 
aaggcccagt 
ggagcttggc 
ccccacctcc 
gcctgggtcc 
gtgattcaca 
atgtctgtaa 
tggcttcaag 
tagaggtgaa 
gcaggttgat 
actggaaaac 



catttcataa 
ggtaaattta 
ggagtaagtg 
agctgtgtga 
ctccccaacc 
tgtgggattc 
cggtaacctt 
ttttattttt 
ttcattataa 
ctcggattac 
ggcttcccca 
cggcctccca 
atttttaatg 
agaatgtcaa 
cagttgttta 
cagaggtatg 
accctatcac 
aaaagagtgt 
tgaacagttt 
aacataaaat 
atcagtacaa 
tgccacaatc 
atccatgtct 
tctgtcacgc 
accactgctt 
aacaaatcat 
gtgatattga 
ctcccctaaa 
atttccctcc 
ttttctttct 
agatctaacc 
gccttccccc 
ctgttagact 
ctttctctct 
gcagtacatt 
gcattccatt 
ctaacatgat 
tgtttcattc 
gaagacttat 
tgccagtgcc 
agactatctg 
tctttctggg 
agagagagat 
ggacatcttg 
acgccataca 
ctcctccata 
atgccagctc 
ggaaaactaa 
tggtgagagc 
gcagcctccc 
ctttctgcgc 
gaaataaatt 
gggacttcat 
atagataagt 
gatggaggtg 
aaaaatcaat 
taagacttgt 



acccctccag 
cagataataa 
cttgactgag 
ccctgaagaa 
ctaatccacc 
atttcactgt 
ctagagaggc 
gaggtggagt 
cttttacctc 
aggcatgcat 
tgttggccag 
aaatgctggg 
ggttgttagg 
acacgtacag 
ttcaattgca 
gcccatatct 
ttgggatttg 
aaataattca 
ttatctttta 
agtgtctatg 
ttttcacctc 
cacctaatca 
ttgatctgcc 
tgtcaatggt 
acccactgcc 
ctactgtggc 
aggctcagca 
ttatgccaca 
tgtttctgcc 
tcatccctgt 
cctaccaata 
attatatccc 
gacagctccc 
tcttcctctt 
tcacataaga 
tctttggaaa 
ggtggagaaa 
tatcacctct 
ttttctactt 
ttcgttgaga 
ggttttaata 
cctcagtttc 
tccacaggcc 
ggaaccaggt 
gtgtgtgcca 
tccctggctc 
tggaattctg 
tgatttctcc 
attcttcatt 
agtcaagcta 
tgctgcctaa 
gcattgtgtt 
aatagtgtga 
gctgcaacca 
ctgctgctat 
caaaacatga 
ttaatagagt 



tcatgggatg 
tgtattggta 
actaaaaggt 
gtcaggcatt 
agccattctt 
cttcagagaa 
acaggccttt 
ctctgttgcc 
ctgggttcaa 
caccatgcct 
gctggtcttg 
attgcaggca 
ataaggtggg 
actaatgcta 
aatggcttct 
gtcaaataaa 
tgaacaacat 
acttgtttca 
tttcttggtc 
aatggcagct 
caccttgtct 
cccagaaaga 
accaagttct 
ggcaggaaat 
aaatccatcc 
actccttgtt 
gactctgacc 
aattcttgtc 
ctctgaccaa 
ctcctctcaa 
agctttcctc 
ttatcattct 
tttgggggaa 
gcctcctccc 
gacaccccag 
tagcttttgg 
aaccttaact 
gctaacctct 
ctttaatatt 
caggtcatct 
ctagttatgc 
ctcaaccata 
tagttggtgg 
aatctcctaa 
gtagattata 
ctgacacata 
cagcagcctg 
cccgcagacc 
ctccgcatgt 
ttgtgcatct 
gcattgtctg 
ttaggtccat 
gggccttggt 
aatgcacatt 
gaagcaacca 
gatgtggttt 
ctcagccaac 



45840 
45900 
45960 
46020 
46080 
46140 
46200 
46260 
46320 
46380 
46440 
46500 
46560 
46620 
46680 
46740 
468D0 
46860 
46920 
46980 
47040 
47100 
47160 
47220 
47280 
47340 
47400 
47460 
47520 
47580 
47640 
47700 
47760 
47820 
47880 
47940 
48000 
48060 
48120 
48180 
48240 
48300 
48360 
48420 
48480 
48540 
48600 
48660 
48720 
48780 
48840 
48900 
48960 
49020 
49080 
49140 
49200 
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agcttgtgct cttaccagcc ctgtgattag acgaaaggga aagttcaaag tgtcacctag 49260 
aagggggagg caccaaagaa gaagaggcag ggaggtgata cagtgaaagg caaggaggag 4 9320 
ctgggctgga atctgagaaa cctgagactg atttgttcta atcatcatcc tatgtgatgg 49380 
tggaagataa gaaccacaga ttctggaagg aaattgatgg catagactca acaacagtgg 49440 
gatgaatatt tatacacaat aaaagggggg caagctggtt ctcccaaaat atttagaaat 49500 
gatgggaagt agggggaatt gtgctccctt gatcacatta ttcagaagtg actgcattct 49560 
tatggcattt ttaacagttt attgagatat aatttatatg ccatagagtt tacttattta 49620 
aagtgtatat ctcatgacat tttaaggtga tatattgtta agtctatgac aaaaagattt 49680 
aaaggaagca atgtgaaaca aagaccacaa gatgagtaga gaggctgtca aaggtggagg 4 9740 
ggtgtttttt taatgtgtct gcttccccag caactggtag atctgccctc atcccacctc 49800 
ccaccaccca aggtctggct atgcctgcag gttcactaca tgaagtaaga ataggtggct 4 9860 
cttagtcagc tcacaggaca ctcacacagc taaatgctag gaatccctct gggaggtcta 4 9920 
ctgcaatctt ggaggtttag aatttgttcc gcttgaactt tcagattatg agtcccactg 49980 
cacagccacc cacccaccct ttttgtgtga cacttgcgtt agcacaacat gtccctattt 50040 
ctccctctaa ttaggtcttt accaattgat ttagaggcca tgttcagttt ccataagcaa 50100 
tcaggtacat cctacaggtg cgttcatatg gtatcatttg tcctcattcc tctggatggt 50160 
tatgatcctc aaaactacct tacctgtaac ctatactaaa atatcttaat cctagcatgt 50220 
gtaattccaa tgaagtcctt cctcttgaaa actactcttg gttctgttct tgaattattt 50280 
tctgtcaata tcttccttgg aatgacactg gctgggtttg tgaactaatc aaacctcatg 50340 
aaggggtaac tagctggcca gggctgagag aatgattaaa ttaggtcact gtttcctaaa 50400 
cacaaccctt ctcctatctt catgtattta tcatatctgt gcatttactt atattttcct 504 60 
ttaacaaaca tattttaatt aaagttattt aaaaaggaaa catatcaact attgcaaata 50520 
gaaaaaccag tatcactatt aataggtaga agctataaaa ttaaaacaaa gcattgttat 50580 
tatattctag ccatatgcca ttgcattcat agcgttctga gcctgattct tgcctctttt 50640 
tgtttaaaat aaaagggaaa tggcaaggta tggagataaa taatttacca aacagatttt 50700 
ctttttcact taatagcagg aataaaagga aatttaaaat gaatgtgcat acgcattgtg 50760 
agtctgttgt ttaatgccat tctcacattg ggaaattcca ctcaccatac ctgggttcta 50820 
gtctttactg agcaattaaa ttgctatatg gctagctcac cttcttgtac ctcaatttgt 50880 
acatatgtag aatgaggttg gtgagttgaa atacagaatt cctttaaacc caagtttctc 50940 
aatgttgctc acgggcaagc tgcacctgaa ttttctgtgt aggtatactt gttgaaaaat 51000 
gttaatttct aagctccatc caagatattc tgaaaaagaa tttctggaaa tttttcctgt 51060 
ttttatcatt tgtccagttg accatgggat attataagat tcaaaccatc accttaaagc 51120 
ctgtatgata aactgaagag tagaactgaa ctgtgaatag gaatcaagag gtctccgttt 51180 
cagcagcaac ttacagtgta agcttgcgct cagctaacca ctctcatttc tccatttgag 51240 
cagtcaggtc acattcggaa ggctccaagt cccttccaat tctacaaata cagtgttggg 51300 
ttgctttgtt caaagttttc tctgaaaatc tgtgctgagg ccaggtgctt ggtaaggttg 51360 
aaggtagaga tggccaaatg acagctgtgg tctgtggggt gcattgtccc ctctggaaag 51420 
ccagccagtg cacgtggaga cgagccagag ctgcctctgc ctgccttgcc ttaacagcat 51480 
atatcctcca gcagggccag ccagatcggg caagcatctg cagaaaggct tagcgagatc 51540 
tcttgcaaag tcagcttagg ccctgctgga gagacaagaa ggaagacttc tatgaatgct 51600 
aagtgggtag aaaggaaact acagcagggt ctggcttgga ttaactcttc atatcctgga 51660 
tgccctggat tgagatctga gtctatgaca tgtgtgtctt tttcaaactg cctcctaatg 51720 
aaacacatat atccatcatc cagagactaa aaactaagtc tcaacctgaa cttcatggcc 51780 
ccccagagca ctcctagcct gtcctctctg cctcaactcc catggttggc caactggttc 51840 
attgatcaac tgattaattc atccaccaat tatttttgag acaaccactc tgggccagac 51900 
cctatgttag ttactgagaa tatacttgtg accaagatga acaaggtctc ttttctcatg 51960 
aaatttaagg gaaaaaataa acaaataaac gggacaatcg cagatggggg aaagtgcttt 52020 
agagggaaag aataaggtaa agcgataaag actagcaaag gggaggcctg tcaaaagagg 52080 
ggacatatga gacccaaaaa aggagaaaaa cccagtccta caaatattgt ggtaatgttc 52140 
caggcaaagg gaagaatcta tacaagcctc tgaggcaaca aaaggcttga gtagctttgg 52200 
gaattgaatg aacgttggtg tggctggagc taagtaagat aaagtaagaa tgatatgaga 52260 
tgatgtagga gaggcaggca gtacccaaaa cctgcaggac ccgggaggtc acaattccca 52320 
tttccaggct cttctcttca gcctaactgc aatacatttt ttcttgagca tacctctact 52380 
ttcccactgc cttcattttg catgtgccta atgcttttgt ctggcataac ttggccccta 52440 
atcacagtat aagcaaattc ttcctccatt ctttcgtacc cagctcaaac attacctcct 52500 
tcacagagct tgtcctaaat accacattta tgctgagtag acatgagctt ttccttcttc 52560 
tgaccactca acttcccctg agcttcattt actctttctc ttttgttctc attcccaaga 52620 
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ttaactttca gtctaaaagt tccttgtggg aagaggaaag gacctaatat ttgaaggaca 52680 
ctatttgtgc caggtgacat gctaggagct tcatatatcc catttcattc atgactatag 52740 
tcttgtaaat taggtgttat tagcctcatt ttagaggtaa gaaaccagtg ttaaattact 52800 
tgaactacag cacaaaacta catgatgcag gtattaactc taaattccct cagtatacct 52860 
agcattgccc cttgctcatc attagtacta atttctcgaa gtgactatac actctggcaa 52920 
cttctaatcc tcatttaaga ccaagaaaag gccggacact gtggctcacg cctgtaatcc 52980 
cagcactttg ggaggtcaaa gcaggcggat caagaggtca agagattgag accatcctgg 53040 
ccaacatagt gaaaccccat ctctactaaa aatacaaaaa ttagctggac atggtggccc 53100 
acacctgtag tctcagctgc ttgagaggct gaggcaggag aatcgcttga acatgggagg 53160 
tggaggttgc agtgagctga gatcacgcca ctgcacacca gccgggtgac agaacgagat 53220 
cctgtttcaa aaaaaaaaaa aaaaaaaaaa caagaaaagt gttgccaggt atggggcaca 53280 
tacctagaac tgtggccatt catcctgcaa cttgaacttt gggcatcacc ttgatccctg 53340 
atacctactt ctcttcatcc caatctgctc tgtccttctg ggccagcctg ttcttcctga 53400 
tgataaataa gtattgaacc tgagcctatt ataccccagg cctcagctga agtgtctatc 534 60 
ccaaagtgct tgatgttcag agagtagctc taaccaaact ttccccatcc agaaggatat 53520 
ccaaagacaa actcattgtc tataacacca agagatagaa cccctataag tcaaacttct 53580 
atgcctcccc tacattctcc actttgtaga gagatattca aatcctctct tttcctgtta 53640 
attactgagc agctgtggaa tcagcatgga tcacaattct gccttccttt tgaaggaata 53700 
acatgcaagc aggagttgcc acattgtgat catggactta cttcagatgt ataataaatg 53760 
acatcccatg agaatggctt gacagcagct gatggcactt ggcaaagcat tctttatttg 53820 
ccataggatg aaattatcac tttactaaga gtaagtgttt acaagcaact ggagatcagg 53880 
aactagagaa aataattcca tgtcccagat tgaaggttta cagcagatat acagtggagt 53940 
gaactagcaa tcacagagtc tagtctttgc tctgccattt gctagctatg aagccttggc 54000 
taacttcatt tcattttcta tgattcatct catggtacct cctcctagaa gatttcctta 54060 
accacttcct cttctcccag tctggccagg tggccctcct atgtccttcc atagtgcctt 54120 
tttttttcat ttatcacatt atattataat agttatataa acactttcgt ctctcactaa 54180 
attatgtgag taaagtatag tgattaagaa caagaatgtt ataatcagac agatatgggt 54240 
tggaaccagg gtttggctgt catctagcta tctgataaaa atacacttct tagagcattg 54300 
ttataaggat taagcataca taaagcaatt agcatagagt atgggtcata actagtaaat 54360 
gttttgcctt ttatttttgc ctttatagcc acagtggcta acgcaatacc tagcacacat 54420 
ggtaggaggg ttggtgatgg tgatggagat gtgatggagg tggtggtgat cctggtgata 54480 
cctgagtgga ggtaatggtg gcaatgttgg aatggtgaca atgttgtaat cccagtggta 54540 
atattataat ggtggcagtt gtggtggtgg tggtgatgat gatgatagta acaaatgagg 54 600 
atgataagaa tcaacagcta acatttgtag agagtttact attgtatata caaggcatgg 54660 
tactaagtac ttgcatatat ctaatttcat ccttttgact tcttttagat atggccagta 54720 
ctccacatct ttcaagtggg tgaactgagg tttcagaagt tgagtaattt cctgagagcc 54780 
tacatttagt aaactaagac aaagtcagaa tttgatacag gccatatgat gtcatcatat 54840 
atatgtaaag tgctcaactg ttggttactt gaatgaggaa ggaaagaagg aagaaagaga 54900 
aggagtgagg gagggaggaa tttacttgtt ttctaatctt aagattcctt ttctgaaaat 54960 
agagataatt ctagcattta tggaaatcaa atgagattaa aaaaagtcaa agctctttga 55020 
aaactgtcaa gtcttctcta tgtacgaggg ctaaaccatc agaggcagct tcataattac 55080 
tcattcattc attcaataga tatttggttt tctgttttag gcactagaat atagtgggca 55140 
aaacaaacta aagtgcctgc cctcagtgag ctttcagtcg agtctagtga cagacaagca 55200 
aatgccaggt gaaatatata gtcctgtggg aaaaaataaa gcatggaggg ggtgagggaa 55260 
tatgtgtgta tgttgggggt ggggtcactc tttaaatagg gtggtgaggg aaggcctctg 55320 
ataaggtgat atctgagcag ggcctgaaaa aggtaagcca gggagtcaca cagaatctgg 55380 
gaaaaaagtg ttttatatcc cacaggggag tttaaaagtc aaatgaggaa tgatatcact 55440 
gtgataagaa tgctaggaat gggattatgc aggaacccca aggcctgctg ccagtgacta 55500 
tgcagtcaag tcagtctttt ccttggtaac ctcctctggt ggggtctctg aaggctcagt 55560 
caaggtgtga tgtcccaggg gagatactat tatctgaaag catggttcct ctggcaacag 55620 
gcaatgggtc tgagggagat atcttctatt ctccattcaa ccatccatga gcatgatgac 55680 
actaggcagg tcatctgtgc tcatggcatc accccttcca gctgggaaag acaaaggaaa 5574 0 
gtcagggagg ttctccccac atcaaatgga aaagtcctgc tccaccaggg tctataatga 55800 
aatggtttat ttaagtctat aatgacctaa gataaagtta atttacttta ttcagattta 55860 
cagacattac caaattacat aggattgcaa acacaggaga agataaaata aattaccaac 55920 
cagattgaag agcttagaga aataggtaaa tagtcatggg aagagattta atttagataa 55980 
accaaaggca ggatatctgg aaaatgccaa ggtcagggaa aagccgagag gaaaagatag 56040 
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ctagtaagat ggaagagaca tggaaaggat tcagaaaatg acgttggtga tagtgagtgt 56100 
ggccagtatt atgataaaga tagaggaaat gggaaagata ttactgaaaa atatggcttc 56160 
tccttagaat cccaaaataa atgaggtaat ttgtggaaag cagtgccaga ctggattctg 56220 
tggtcttagt tccaagtctg ctaggaactg cctctgcaac ttgggaaagc tttttgattt 56280 
ttttctaacc tttagcttcc tcttctgtta tgacatacaa tacggttcta agtctaaatg 56340 
tttctagttc tcttgtgttt aaggaaggcg ccatgtgctc tgaggagacc ctgaatccat 56400 
ccctggtcct cctggctgac cttaatagaa acattgaaag atggagattc atgacaagta 56460 
gtcaaacgga aaggcattca ttttggagat ggatcaaggg tccaggctca tcggagacac 56520 
tgaaaaagtt gaaaggttgc aaaattgagg gagaacgaca gttatgatag ttcagtataa 56580 
ctgggttttg ccagaggttg tagaaaaaag ttaaagattg atttgaaatg gttgaggggt 56640 
atgtacaatg cagaaggagg ggtgagttaa ataagacaat ttataggctg gatgtcttca 56700 
gggactgggg ggaagtttgg gtatctttaa gatggcatgg cagtctgaca tacaggaata 567 60 
gttgtatcca ctgcaaatca taaccttcta tgcatttcag ggtcggctta aggatatttg 56820 
caaagaacat actggaggga gaacatcaga cttgtcctag ttgcttgggt cccgggttag 56880 
ttcaccacta cagcaatgtg ggagctaact gcatgctaag cttggcatgc agggattggc 56940 
aaaccttttc tgtaaaaacc cagatagtaa atgttttaga ttttgtgagc catacggtct 57000 
ctgtcacaat tactcaactt tgctgctgga gagtgaaggc agcaactgat tggtccctct 57060 
tataattgaa gctcctgaga tggcattctc ctgaatgata atatttaaat gaaggccata 57120 
gctgtattcc aataaaactt tatttacaaa cagatagctg aatttggccc acacaccata 57180 
gtcggccaac ccctgtttaa ggcatgatgg ccctgcctat tagcaaatat gtcctcatca 57240 
gtgtcctgaa acatgttcaa ttttatgtgt aactcctccc tattgggaac aaagtgaggg 57300 
ccaacaattc ttttccagta ggtggtccaa agaatataat tactgaggcg tgtgtcactg 57360 
gaaatgcatt ccaatataac aaacatctta ggtacttaac ttgttcattg atgtttgagg 57420 
atgaatttgt aggggaacat tagatgttcg tatgattgac ctatgttgaa aggagactta 57480 
ctaaggtaga ggcaggtttt cactttctcc aagaaggaag agttgaagaa gtcccttcaa 57540 
tctagaggaa tcatcattgt gcaaacatca gggaagagat ggagcaaagt taaatggctg 57 600 
gaagtcacag ggagctctgt tatggctagt tgtaacacag attatgcacc tggagcttta 57 660 
cattttataa aattttttgg aagtgctttg ttaccttact tatcagaaga tgctaaagtc 57720 
attcccatag gttctccatg aatgaaggaa ttcaataatt attaccgatg atgataattt 57780 
atgcttatta taatgtaatg gtcattagca cagcagcagc cattcactga ggcactgtgc 57840 
ccagggctcc acaagcatca tctcaagttg ttccaacagt catgggaagc aggtatgggg 57 900 
aacgtcttac agatgaagaa attgagagtc aggtttgata attggcccaa ggaaacacag 57960 
tttattataa gtgccaaagt caggcctcaa actcagctct gctcccacag ctgacactcc 58020 
taaccaacac attgtcctat tgcacaactg cagcattctt tcattagtgg gtatgttaat 58080 
gtgaagaacc agcacttgga ccagtgcctt ataaggatcc tttgaaacca agattttatt 58140 
gtggttgaag ggcagagagc taaatcacca aaatgtgtaa gtagagacag accagaatgt 58200 
aatctccatg aaagcaatgt ccttctctgt tttttaattt tttcaccatg ttatccctag 58260 
cacttagttg tgtctgataa atatttgaat aaatgcatca gtatgtgaac attatttgga 58320 
attatttttt ctttttaata tcaaaaatct cataaccacc taggccaagc ttaacccgtg 58380 
ggcctcaggc cgcatgcagc ccaacacggc tttgaatgag gcccaataca aatttgtaaa 58440 
gtttcttaaa acattttgag actttttgtg tgattttttt tttagctcat cagctattgt 58500 
tagtgttagt gtatgtgatg tgtggtccaa gacaatgctt cttctttcaa tgcagcccag 58560 
agaagccaaa agattggaca cccctgatct agcctgtctc aatgtctagg ctttgagaca 58620 
cccctgatct aggctataga aatgtctcct atttctctct cttttaacca catagttcaa 58680 
taccctggac acagtaggca cttaaggaat atttgctaag gatggggcat gatagctgac 58740 
ccttaataaa taaagtgcta cattgctaca gtaatcaaca accttctacc cttcctagct 58800 
cctgtgacag atgcaactac ttcttattcc ccatgtggga gttgtaactc aacacttctc 58860 
tcccctattt atggatgcag attggaaaat acattagaaa gggagaagcc aaaatatcct 58920 
ctgatttata atcagtactt tggttgaaac tttcatgact atattagtaa tagctagtta 58980 
cctgtaacac agtttagagc cattaataaa atacctcgtg gaacttttat gtggctccat 59040 
gtagcagagc tcagacctgt ggaagaacct ggagtaatta gctggaagga tgttatcaca 59100 
gttagcacca tccaaagagt gaatccactg cagagtgaac taatgagccc tggtctgaga 59160 
gacactgaat cagaggtgac caggagtcag gtatagaggg gctatactag gtataggtca 59220 
gatataatta tcactaaagc ccctttaaaa tataaacgtc ctagattttt gacatacccc 59280 
atgtctccat ttattgtgac attgtctttt ggtaaatgaa ccccttctca taaaaaaaat 59340 
ttttgttgaa atataatata catcacttgc atgaattttt actcctgctt gagcaggctg 59400 
agcctggttg gtaatgttag caggatctca gtctatggca gacatactga ctcttccttt 594 60 



81 



CL001205 



gcttttaagg tggtgtgatt tcaactaaca 
catgtcctgg aaagagagcc accttgccaa 
tcgtgagctc cgcatggtcc tgggatggcc 
aaaaatcttc tcaattgtga aacacagttc 
aaggcgaatg gtgttgtacc attgtattca 
ccctgttgtc agaggtacag cagtagttag 
ttttctctag atacagaaat atttcagtca 
actgatggtt tagccagttg ttggtcagtt 
aaaagcagta tcctttcaaa ttatatgagg 
tgtgagaaaa aggggaatag aatcaaagta 
cttattatgt tgctatatgt ccctttagac 
tttctaagac tgggattatg gtgtatgcac 
tgagcatctt tcatgttaat aaataggcat 
ttccttgatg taatatatca taatgtgttc 
ctgtgtattt taaaaacaac aaacttgaat 
tgagatattt aggtagcaaa cagctctgat 
gaaaatgagc tccccctcct ttcaaccagg 
taattgagat ggtacacatt aatgatccct 
ccaacacttt gaaatattag taatcaggct 
gctctgcaca atacattatt acttgtggat 
agaagcaggg tcttccgagg ttaacagtca 
aagccataaa tttaaactct gaaaaactca 
attggagtct ggaagaaagg tcctctagac 
ttgtgggtct gagatcagaa agtaaaattc 
gtgggcaaaa atggccaaag aatggactgt 
ctcccttttc aaaaattttc caggaaacta 
gcatctctca aagtgatcta gaaattccaa 
aggggagata gaagggtggc ttgcaaagtg 
attcccttta tgtgggacta agaaaattat 
attaaataag aagtagagag gcgttacagt 
ctatcggtat gaaattggat catctcttta 
aaatagagac aaaaatatgt tccttagaag 
atagagaaag cttaggctag gatctgtgac 
gcaggtgttt tatttttttt tcctttccag 
gtatatagat atgctaagat atgttcattt 
aaagacgtgt acacacagct gtcctagaag 
ctctccaggg gtgaatggga gaggctaaca 
agatgaaaaa aaaaattaag gcagggtagg 
cacaactggt tggatgaatt gttactatta 
ttgttactat tattgtacct attctaatag 
tatttgcaac tagaaccaaa tcaactactg 
aaggtagaaa aagcttagca tcttcatttt 
ctttacctgt tcctcagcta aaatgaggat 
tgttgtaaga attaactgaa ataatgtaca 
gaaagcacag attgccagat cccaccccca 
acccaagaat ttatattctt ttttattatg 
tacatgtgca caatgtgcag gttagttaca 
agctctcaag tgatgctgat atagctggtc 
aggtcaatgc agtttttttt aaaagctatt 
gttgattatg ttttattatg tgtctagacc 
agaagatgta atattttctg tagcattccc 
tagttaatgt atgtgttgat tttatttgtc 
catacattta tttgaatgtt gtagaatgat 
tacagttttt tttgtttgtt tgtttcttaa 
ttattttatt tcatttgaac ctttgataga 
ttttctgctc atatgaccac atttcccagt 
ttcgattaat ggaagagaac agatagctct 



gaattcatgt tgtattagac cacataagct 59520 
ccagttcata aaaattagct cagcactttc 59580 
tgaggtgggt cccagcaata tgtatttttt 59640 
tgaaatagaa cttcatcatt tggggtctgc 59700 
caagtaaact gagacccaga gactaaatgt 59760 
gggcatattt gggactaaaa ttcatattag 59820 
ggccaatttt tatacaaaac taacttcagg 59880 
atatgtgtag gtttgtgtgt tattgttata 59940 
gatacatgtt tatcatggga aactataaaa 60000 
tctatcattg cacaactctg agataaccct 60060 
ctgctattca tatgttgggt tttttattat 60120 
agtttttcat tcttcactgt tattatatta 60180 
tcacactagc tcttaaatag ctatgtcgaa 60240 
aacagagtct ttattggatg tttaggatgt 60300 
aaatactata caacttctaa agctcagaag 60360 
gaacttaaag agttaacaat cagctaacca 60420 
ttaaaacctg gttctttcaa ccaggctgga 60480 
gttctcatca ttgttcagtg ccaggcatgg 60540 
cactgagttt cttttcaggt ttgaaacaga 60600 
ccttgagact tctctgccac ctccaagagc 60660 
ccactgggtt ttaatcagga aaaagaaaga 60720 
gtacccaccc agaaacagca acaaaatcag 60780 
ttgagaaagt tcagggtgaa attgggttta 60840 
taaagagctc aaggcaacaa catccaaaaa 60900 
ggtattttct gggaaagaaa acgatttccc 60960 
attgtcaggc aacccacttg tagaatggaa 61020 
gaagattttt tttctactgt tatgatgccc 61080 
gactcaggtc tgcattccaa aactgccttc 61140 
attatccatc taggcatttg tcctccatcc 61200 
actcatcaag aattccagtt ctatcgctta 61260 
atctctttga gtctcaattt ccagtcttta 61320 
attgcattga atattaaaat acaatgagct 61380 
atggttggtg tttaataatt acaggattca 61440 
ctccaaatta ttctatgtca ggtgttaggg 61500 
cccaccaggg agtcgagtcc actgtggggt 61560 
atcaagaagt agagagctgt gtcatgaagg 61620 
gtagaggcag gcattggggt gggctttaaa 61680 
agcagaagca ctcagataag gaaaaaatgg 61740 
ttgcttgaat ggcacaactg gttggatgaa 61800 
gtacaattgt taaggtaaag gaaaaaacca 61860 
ctagaaaatt taaaatcttc cttcaatatc 61920 
gctactaaca ttctgacctt ggacaaataa 61980 
gtaaatccta acctgtctat cttaatagga 62040 
gtctgcctca caatcatctg gacggcttga 62100 
gaggttctaa ttcagtaggt ctggggttgg 62160 
atgatgatta ttatacttta agttttaggg 62220 
tatgtataca atgaatttat attcttaaca 62280 
tggggattac agtaagaaaa ccactgcttt 62340 
attcataact ggctcaagag tcatgtaagt 62400 
cttaatagac tactcaggat atgcacaaat 62460 
tacaccaaat cggccttgct cggtcatgca 62520 
ttcgtgtggc ttctagtctc actagaaaga 62580 
attattcatt ggaataaaat cggatctttt 62640 
tgagcaggcc tatgattcct aaaggacttt 62700 
aactgcatct ttcatgtgac cactctccat 62760 
agtctttaaa ggctccagtt tggatgggat 62820 
gtaaatagag ccagatgtct cttgcaatat 62880 
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ctgaaatgaa 
aggagtagtc 
attacagcca 
ctgtgtcata 
acctagcttc 
tgtggttcct 
cagagaaagc 
tcactttgga 
gatctgtagg 
ccagtgattt 
gatcaacatt 
aaaaagaaaa 
agccgaatgc 
cagcttgtgt 
ctgtttttgc 
ctgagaacaa 
gctgcatcca 
ttttttaatt 
ggaagaatac 
agattctgtg 
cccaatcata 
ttcaaagaga 
ttatttattt 
ctatgttgcc 
ccaaagtgcc 
cataaatgac 
tacattccaa 
attatttcac 
gtggtgaatc 
cgaatgagag 
tttattaaaa 
acggggagac 
ggtgtgcttc 
catatgcaca 
gttgtatcca 
agttaaactc 
gagatattgg 
ttccctggca 
catctgatgg 
ctcgtgtctg 
accctccccc 
tgatacatag 
tctttgatgg 
agtcagccag 
gtttaatttg 
ttcctttcag 
tagatccatg 
tctagttaat 
gggctcagga 
ttactctgtt 
tcccaggttc 
accagcatgc 
aggctggtca 
gaattacagg 
caagtgattt 
tcctttcttc 
ttgaaactag 



ctgtctcaac 
tttgcattat 
atcactatgc 
gccttggcac 
agttatgcca 
ttttcaaagg 
cccccctcat 
attgtctgcc 
cctgttttct 
tatgtgtagc 
acaaaaaaac 
aaaaaaagaa 
tggccacgtc 
gactccttta 
tctttttggc 
ggattttcta 
gtaccaatta 
ccccagatga 
tactcagaat 
ggggcagggt 
ttttgcatag 
tgtcaaaatg 
ctccagcttt 
ctggctggtt 
aggattacag 
agtggtgtca 
cattccttct 
cctggtgatg 
catacaggtc 
gcataagaca 
agctttagaa 
ttgagagacc 
gggggtctta 
gtggcctgtt 
tgctcacgtg 
caccattttg 
gaaaatgcga 
ctggctgcaa 
ttgcctgaca 
actagctacc 
ttccccaccc 
agaaaaatag 
gttgagttgt 
gtctaattgc 
aggtctgcaa 
gctagccact 
ttttttaagt 
gactgtccac 
agtggttttt 
gcccaggctg 
aagcgattct 
caggctaatt 
caaactgctg 
catgagccac 
tgatgcaggt 
cctgtctact 
ttaaaaccag 



catcagaact 
tcatagccct 
tttcatagcc 
tctgttcttt 
agagtgtttc 
gaaaatacta 
cctctccgaa 
agtgttttgt 
ctgcaccaga 
ataacttttt 
atcatgagac 
gaaggtttgg 
ctgtaagaac 
cattacctgc 
aagttcacag 
atgttacttc 
aataagaatc 
ttccagtata 
ttattgtgca 
ctaaggttta 
caaagcctaa 
aagctttcct 
tcttttccaa 
tcaaactctt 
gcatgagtca 
aggatattcc 
taactaaaag 
ctcatgggat 
tgcagcaacc 
gagtgaaaga 
caggaatgaa 
aagtgcatgg 
cattacttct 
agtgcttgtg 
aggcattctt 
cctcttagtg 
tcaccagttt 
ccaattatta 
tttcttgtgg 
tactgtaata 
agttattgct 
ttccagttgt 
aagttatgtg 
ctagttatat 
ttggagagtt 
tctctgagct 
atattatcaa 
gtcccactcc 
tgttgttgtt 
gaatgcagtg 
cctgcctcag 
tttatatttt 
acctcacatg 
tgcacctggc 
agtaggccag 
agtatagtga 
ggttgtgcca 



gtaaatcacc 
tttgctgtag 
catcctcctc 
gaacttcttt 
ctttctgtga 
tagtcaccct 
gcagtcacat 
ggggtgaggc 
ggtcccaaac 
aagaggaaga 
tttgaataga 
cagtactgga 
ccacactctc 
ttgactccta 
acttacagag 
acatcaaaat 
tctggggagg 
cagatcagtt 
tacaaatcac 
gcatttctta 
gagatcttct 
tgttaagtat 
acaacctttt 
ggcctcaagc 
ctgcatctag 
tgtggaaagt 
agaaagaagt 
tcccatttta 
tcaattcttg 
ctgaggcagg 
agaaagtaaa 
tttgacctct 
ccactgattc 
aggagccgca 
cccttaccag 
tgcatgcttg 
caggtttttc 
ttttagagag 
tggcagcagg 
aatggggtgc 
caagatcaca 
ctttataagt 
gcagcctctc 
gagtgtactg 
gatgctgaaa 
gaaaatgctg 
aggactatgt 
cagagattcc 
gctgctttgt 
gcgtgatctt 
cctcccaagt 
tagtagagac 
atctgcccac 
caggaagtgg 
atgcagagaa 
ccattttctc 
tacttctact 



ttcagaaggc 
tacattcttg 
cctgaacccc 
gaggagtagt 
tcatgataca 
aattccttca 
gctctattca 
tcagactctt 
tggtgttcta 
gagaagaaaa 
aatttggatt 
ctcccatctt 
cagtttgctg 
aaggtatttc 
tttgaaagct 
cacctatacc 
accaagctgt 
catctaagaa 
ctggagattt 
caagccaatg 
ggtctgtcct 
ctaagcttag 
tctgtagaaa 
aatactccca 
ccagagtttg 
agtttttcct 
gggtcttcag 
tgatggtaga 
cctcctcaaa 
tttcagagca 
gtacacttgg 
gacttggggt 
ttccattggg 
tgcacagtgt 
tattcgtaga 
agctcattca 
tatccattgg 
acagtttaat 
ggggaccctc 
agattagaaa 
taattatgaa 
ggttcaaaac 
ctttcaggtg 
agtaggtaac 
catttctcag 
tcatggtgaa 
gtatcggaat 
gagtcagtaa 
tttgtttttg 
ggctcactgc 
agctgggatt 
ggggttcccc 
ctcagcctcc 
catttttcag 
atatgatata 
ctgaatcaaa 
cagctcagaa 



tccttctccc 
tctcatgaag 
tgctttcatg 
gttttccagg 
ttgtaaagta 
agaaagaagg 
gaatgaatca 
tctatgggta 
tggattgcat 
ttgactttgc 
ttcagcaaaa 
taagggtaac 
ctgtccatac 
aattagtggc 
aaaggagtcc 
ctataaccag 
caggattttt 
ccagtacctt 
gttaaaatgc 
cttctggtcc 
ttttcataag 
gatgaattat 
cagggtcttg 
ccttagcctc 
cttttaaaag 
cttatttact 
ccaaaagaac 
tgtgttactg 
agaaagaatt 
ggagtgaatg 
aagagggtta 
tttatgtgct 
atggactgtc 
gtttactgaa 
accatatgcc 
cccagtttct 
gaaactgcct 
aatcgcctat 
tcctgtcctg 
acaggtcctt 
atagaagagc 
tctgtgagct 
agaatgaagc 
tctctcaata 
gaccagaaat 
ttcattcttc 
cccttgagac 
atctggggtg 
agatggagtt 
aacctctgcc 
acaggtgccc 
catgttggcc 
caaagtgctg 
aaaactcatc 
aaggtaaatg 
tactgcagcc 
ggaggctctc 



62940 
63000 
63060 
63120 
63180 
63240 
63300 
63360 
63420 
63480 
63540 
63600 
63660 
63720 
63780 
63840 
63900 
63960 
64020 
64080 
64140 
64200 
64260 
64320 
64380 
64440 
64500 
64560 
64620 
64680 
64740 
64800 
64860 
64920 
64980 
65040 
65100 
65160 
65220 
65280 
65340 
65400 
65460 
65520 
65580 
65640 
65700 
65760 
65820 
65880 
65940 
66000 
66060 
66120 
66180 
66240 
66300 
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catttgagaa cacatgggtt ccttttgcta ccaggacatg cagcttggaa cctctgattc 66360 
tcagtgatgt aggcattttc ttagcataca gcagcctgga atttatcata atgtacatgt 66420 
cacaggagga tatgaaatag agtaaacacc ttttttatag actttagatt ttgaggtctc 66480 
actacagcag cgttttgcaa ctttttaaaa aagaaccaat aatgatctct ttggatgttc 66540 
ataaaagcct cacactctgc tgtcccatac tagattctga tatacctaac tttgaaggat 66600 
gtccaccatt gagtgtcact acatacaggg agcaaattcc atttcatttt tcgcgcattc 66660 
ctaccaacga agagaaattt gaggcacatt attttaggaa atttgtacca taaaaacaat 66720 
aggtatacat caatgtttta tcatttataa tgtttcaacc ttaatatgtt tttgacactg 66780 
ctcctgctgc actggaggaa tgttgtaata gattgttaca taaccacttc tcctggcagg 66840 
tattttcccc cttccttatt ccactaagac tcacttgcaa cgcaaagacc agggcctcat 66900 
ggaagaaggc agctgggcac aagcctgttg ccatggaaag cttagggcgg gaagcgattg 66960 
attggtctct gcatacagag actgatctag aaggcttcag tgtgtctgaa tggacctgtc 67020 
tgggctggaa tttcagtcag tctgacagac tgctaaagga gactcaggtg tacacttcag 67080 
caacttgata actcttcccc agctgaaagt cgaatcattt gatccaacgg gaaagaagct 67140 
aaaattgtcc tgacagctaa agagcgatct gacctttgtg atcaggagga gaatttcttg 67200 
tgatacagaa agtgaaagta gaaactggaa tgatggtatt gatgatgaag gcatttagag 67260 
cagcaggcac tgccattcat taaaagattg cactgtgtca ggcacatgct ttatcctatt 67320 
gaaccatcat gccaaacttg ggacacttag gtatcattgt ttttcatttt ccattaagaa 67380 
cttgaggatt aaagctgtta atctatctgt tcatggtgaa atgagtagta tgtggcctgg 67440 
gtttgaacct ttgttatagc tgcaattttt tcagagcttt taattgttga actacacggt 67500 
gtttctgggg ctattcttga tctttgcaca gtgtgggggc gaaagttggt ccttttagga 67560 
attcaactat tctctcattt agttatggcc aagaaaggag aggctcagag atcagttggt 67620 
gcctcaaaag gtcgctcttc cctgcagccg acttaatatg tctcctagct cccagtccag 67680 
tgctccagca agatccggaa gctaaatatt ccctgaagcc tttataatct attgagaata 67740 
cacaacagag ttgctgggag tttttccaag gcagtaagtc tcatttttct tattagtttg 67800 
gatcccctga acccttaagg tagtatctca aatgcagtta agtgttcaac caatatttgt 678 60 
tgcaagaata aacattacta gtaagaaatg tgatcttgga gacaatattt tatgttccta 67920 
gtctctagtt ttcctatttt tgaaataagc aggtgtgttc tagatatttc gaaaggccct 67980 
tccactttaa aacttattta actctagagc aaggatctct cacctgaagt ctgaagattg 68040 
gtctatgtac tctctgtaac cctacatgaa atgctttaaa acttcattta cagttatgtg 68100 
tccattaaca attggaatac attcagagaa atgtgtcagt ggacaatttc accattgctt 68160 
gaatatcaca gagtgttact tacacaaatc taaatggtgt agcctactac acacctaggc 68220 
tatattgtat gatacagatt atagctccta gactacaaac ctgtgcagca tgttactgta 68280 
ctgaatactg aggcaagtgt aacacaatgg taagtatttg tgtatctaaa tatatctaaa 68340 
catagtaaag gtacagttaa aatataatat aaaagatgaa aactagtaca cctgtataag 68400 
gcacttacca tgaatggagc ttgcagaact ggacgttgct ctggtgagtc agtgagtgaa 68460 
tgatgaatga atgtgaaggc ctaggacatt actatgcact actatagagt ttataaacac 68520 
tccacggtta ggctacacta aatttattta gaagtctatt tttatttcat tttgtttttg 68580 
aaatggagtc tcactcttgt cacccaggct agagtgcaat ggcaagatct tggctcactg 68640 
caacctccac ctcccaggtt caagcgattc tcctgcctca gcttcccaag tagctgggat 68700 
tacaggcacc tgcccccatg tctggctaat ttctttaaca ataaagtgaa cttggcttac 68760 
tgtaattttt tactgtacaa acttttaaat gttttaactt tttgactctt ttgtagtaac 68820 
agcttaaaac acaaacacat tgtacaacta tacaactttt ttttccttat atccctatgc 68880 
tataagcttt tctttttcta tttctaactt tttgttactt tttaaacttt ttggttagaa 68940 
aatgtttaaa gggtaaagac acaaactaga gaatttatat ctttaaactt taaatgttta 69000 
agaagtaaag atattaagac acaaacacat gcattagcct aggcctacac tgggtcagga 69060 
tcatcaacat cactgtcttt cttccacctc cacagcttgt ccggctgaaa ggtcttcagt 69120 
ggcaatagca cgcatggagc tgtcatcacc tataatactg ccttcttctg gaatacctcc 69180 
tgaaggactt gcctgaggct attttacagc tacctttttt tataagtaga aagagtgcac 69240 
tgtaaaataa caacatatat agtaaatgta taaaccagtt acataactgc ttattatcaa 69300 
gtattatgta ctgtacatga ctgtatgtgt tatactttta tatacaacca cagcacagta 69360 
ggtttgttta agccagcatc accacaaaca cattagtaat gcattgtgct acattactat 69420 
ggccaagata tcactaaaca ataggaattt ttcagcccca ttataatctt atgggaccac 69480 
cgtcatataa gcaggtcatt gttaaccaaa acatcattat gtggtgcgtt actatatttt 69540 
ggtataattt acatatagta aaattcactc cttttataca gttctatgaa ttttgacaaa 69600 
cctatatagt catttaacca ccattataat caatatataa aatattttaa ttatcccaaa 69660 
aaggtttttt aagcacattt ataggcaatt ttctcccctg caaccccagc ctctggaacc 69720 
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actgatttgt tttctgtttc tataattttt ttcttttcca gaatgtcatc tacatggaat 69780 
tctgcaggat gtagcctttg cattctggct tctttcactt agattcattt tgagatccgt 69840 
tcacattgtt gtgtgtatta gtagcttgtt cctttttatt gctgagcaaa tttttcattg 69900 
tatgtctgta ccataatttg tttattcatt ccccagctga tgaacattta gtctgtattc 69960 
aatttggggc atttataatg aagctgttat aaatattcat acatagtttt ttatgtgaac 70020 
atattgctta atttctcttg tgtaaatatt atggggagtg ggcttatgga gaattgagaa 70080 
atatacatta atgtatctat aatttttttt ctgggaagat agtacacagc ttttgtcgga 70140 
ttcccaaaac gggccagggc tcacaaaagg ttaggcccca ctgaccttgc tgcttcctct 70200 
ggtttgtgtt gggttcttta agctctgtgg tctggctttc aagtggagca tcaaactgag 70260 
aggcagcctg gcttaggaaa acaggcattg ggttaggatt gaagtgcttc ccaccttagt 70320 
ctagtctttc actgtgtgac attgccagcc ttctttccct ttccgggacc ttggaaaccc 70380 
tgtccattgg taaccagttt gatggctaag ctccattttt ccatcatgtt tcctagacaa 70440 
cgccatgcaa gcttctgctc cagccccttt ggagctctga ttcagacact aatctcaggc 70500 
cctccaagga agcatcattc agaccttccc tgcttcctgc agaggcacat gtagtacagt 70560 
acgtgaggct ttctatggag ctgctctcat ttttgttcat taacttctct ccctgggagg 70620 
aggcatgcca gggagggcgt ttatcaagat gaggcacatg acaataggag caccagattt 70680 
ccagtccctg ttttgttatc agaagttcgc tttccagact tgggcaaatc actttaatgt 70740 
ctcttgctta aaaccttgac tagcttttca atgtttttgg ggacaaggac tcaaggcctt 70800 
agagatcttg tgatctctct acatgtgtaa taaatatata aaccataata tagctgctta 70860 
ttatcaagta tcatgcactg tacatgactg tatgtgctat acttttatat gcaactggca 70920 
gcacagtagg tttctttaaa ccagcatcaa cacaaacaca ttagtaatgc attgtgttac 70980 
aacattacca tggcagagat attactagac aataggaatt tttcagctct attataatct 71040 
tatgggacca ccatcatata agcaggtcat tgttaaccaa aacatcatta catggtacat 71100 
gactatattt tggtataatt ttatacaaaa aaaaattgcc tgacttttgc ctacctaccc 71160 
agctcctact tccttgagtt cactcctttg tgcactccaa ccccactcac ctatctgtta 71220 
tccttctgac acaccctatt gcctccttcc atttggcttt atcctacaaa gtttgctgtt 71280 
tacaaagttt actctgaaat agaggtcatc attttgaaga ctagaaaacc aaggcacaga 71340 
gacttacatg ccttgcccaa gtttgcatag tttgtaaatg gaaaagctgg ggaaaatcca 71400 
cggaaacttg gctctggatt ccatgctatt gaccaactcc tgacttcctg ccctgctaga 714 60 
ccttagtggg tggtcatgaa ataaaggatg taggaacagt gtgccaagag ttctgttgga 71520 
ttattttagc tgaaatctca taatggcttt ttatagaata ttaatgactt acagagctct 71580 
ttgctctggg ttatggactt aggtttctaa attgtcatct ccttctagat aacttttcag 71640 
gaacttatca gcacccacac aggactggat gtgcttttgg aaggtaccat ggaggctgat 71700 
atggtgtggc tatatcaaat ctcatcttga attgtagctc ccataacccc cacgtgtcaa 71760 
gggagggacc aggtgggagg taattaaatc atgggtgtgt gtttttccgt gctgtcctcg 71820 
tgacagtgaa taagtctcat gagatctgat ggttttataa agggcagttc ccctgcacat 71880 
gctttcttgc ctgccaccat gtaagaagtg cctttgctct tctaccatga ttgtgaggcc 71940 
tccctagctg tgtgcaactg tgagtctact taacctcttt ttctttataa actactcagt 72000 
cttgggtgtg cttttattag cagcatgaca acagactagt ctaatacaga ggctttgggc 72060 
ataagctatg ggcctttggt tttgaggttc tccaatattc agctgggaaa ttatgtacca 72120 
ctaagatgag ttttggagaa gcctccttga gagtaattaa accagagcag caaaacaata 72180 
gactcctacc tgtatggaaa ggacagagtc acaagtcaaa gcttcaggct gaacctttct 72240 
tagcaaagtg agaagggtag atctgagaca aaggagcagg gatttttttt aggttgattc 72300 
ctgtatttat gtaacttctt tctcctcaac agtacaggtt tcttcatgat acatgtcttc 72360 
aagacagcat tggagtaaag gattctcagt ggccttgttt tattcaatgg catttagtaa 72420 
acgtggggaa gggatcaaga gtccacatca gatcctggcc ctggagcagc ttttctagtc 72480 
gggggtcatg atagctacag tctctctagt atcttctctg tgccatttaa ataaataaat 72540 
cagagaattg gactcagaat ctgattctct gattccgctg agtcagttct tggccagggt 72600 
catgaaactg gtaacttgta cagcctggac ccgtactcaa ggatgtgaga cttggagtgg 72660 
gggttgagag gaggacagtt catgacaagc ctgacccaac atagtgctgt gcccaattag 72720 
ctagctacac agataagcaa actgaggcta ccactgggtc agttcttggc aagggtcagg 72780 
aagctgtaac ttgtagagcc tggacaagtg tctgtgatgt caaaccccgt ccatgtaact 72840 
gctagttact atatgggaac tcaagtttat tacttaactg tattaagctt ccattttctc 72900 
taagaaatgg gggaaaaaaa gcttttacct tggcttaaag cagttgcaag aattaaaaag 72960 
ttaattcatg ttgtttattt atcagaatgc ctagcacatt ataagtctca gtataggaga 73020 
attgtattat tcttaccatc atcaccatca ccaccattat ttgccaagag agaggtatat 73080 
gcaaagtgct atagtggttc aaaggagggg gcaatcgctt ccaccaggaa gacagtggag 73140 
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ggtagtgtga ggataaatgt aaaacctgaa aatttaaact tgaacttgaa aggagactgt 73200 
gtatgtggga attttggatc aaaaggggat tccaatagaa gaaacagctg agttaaagca 732 60 
aagcgataga acactgataa cgactatgaa tggtgaatag agcattttgg ctgattgctg 73320 
tgtacctaaa tggagatgag ggtggaaaga tgcattgggg ggtaagactt tgagaaccaa 73380 
gttcatttga gaaccaagtt catttgagaa caactacgaa tggtgaatag agcattttgg 73440 
ctgattgctg tgtacctaaa tggaggtgag ggtggaaaga tgcattgggg gataactatg 73500 
agaaccaagt taagggaagt gtcatgagca gagctatgct ccaaaaagat ttagcaggcc 73560 
atctataccc attatgtctg gggtgcatat gagggagtta caggatgtga gacttggagt 73620 
gggggtggag ggagggcagt tcatgacaag cctgacccaa cacactgttg tgtccaatta 73680 
gcacgaccta ttttggcttt gaatggggaa ggagaatctg tgacctctga gtcagccttt 73740 
atggaatgct caagacttaa caggtcttgc aagatgcatt ctctatcagt catagtactt 73800 
taataatggt tccttactca gcttcattca ttcattggaa attcatgcat gtctttgtag 73860 
tccataggat tctcagaata accttgtgac tagacgagag aaatattgac tgtgcgatta 73920 
ctgaaggcaa agactgtttg tgatgcttga ttacttgatt tttgcatatc caacaagtga 73980 
atgagtgagt aagtgagtga gtgagtgaat gattgaatct ccacttgact cctgaggcaa 74040 
ctgagatgag gagatattaa aatatctcaa ggtgtcatta ttggacacac atatgactag 74100 
gacttttgtt actttcatac ttctccattt acaatttctt aaaagagtac tttccaaagt 74160 
atatacaatt taatgatttg aaatcctgct tgcggccagg cacggtggct catgcctgta 74220 
atcccagcac tttgggaggc tgaggcgtgt ggatcacgag atcaagagat tgagaccatc 74280 
ccggccaaca tggtgaaacc ccgtctctac taaaaataga aaaattagct gggcatggtg 74340 
gcaggtgcct gtaatcccag ctacttggga ggctgaggca gaagaatcac ttgaacctgg 74400 
gaggcagagg ttgtagtgag ccaaagactg tgccactgca ccccagattg gcgacagagc 74460 
gagactctgt ctcaaaaaaa aaaaaaaaaa ccttcttgct tttgttgctg ttttcttcct 74520 
aaaagtggag ctttgccttt tcccttttaa gtcagtcact gagacagtat ttggttttgt 74580 
gccctctgag agttattttc cagcccacga gccccacgtt ctgattctga tgtgagataa 74640 
tggagcttag ccctcagaga tatggaagac accaattgtt tcgttctcta agtgttcaga 74700 
aacaactgtt cttttctttt cttccttttt tacttcccct tacaagactt tcttttcttt 74760 
gcctcaaaga gggtaaagga tggaagatag aaggaggaac tcaggttagt ctatggctca 74820 
gaggccttga atagtttggg aaaaaaatta aacaacatga aactataact tctcattgtt 74880 
ctctgctgct tccacctccc actgacagtg tatgctttcc atcaaatatg tgtaatattg 74940 
gttgtggagc agatcaaaaa catctatttt ctctggaatg tacagtcagt caatcttgaa 75000 
gaaggagcat cgtatcaaag aggtctgttg tcaaatacct acttccccct ttctgtcctt 75060 
tctcttcccc ttcaagggaa acaaatgtct gggttgcaga aaggcccttt catctcaaat 75120 
tgttctcttt tcattagtgt cttggtttgg gcagggggta tggaggcagt ctactatgag 75180 
tagtattgtt gaggacatga aatgttaaat aataagattt tacatttgta aagtgccatg 75240 
ttccttgctt ttacagacag aaacttaggc tgctgggttt tgaacaatta agttgaggag 75300 
ctatttggga tatgttgtat aaatcataac cgcttgttga agggtgtgct gaaatcaagg 75360 
tgtgacctct gcacaattca gggacctcct cttgggcaag tctagagtcc cagccttgcc 75420 
aatgcttcag aaatgattag attcaaggtg ccaacatcag gccaactgaa cccttgagag 75480 
attgccctat aaactgttat tggagcagca tctgtttgcc agggtaaata ttcagtcata 75540 
tggactgggc cctggagtct tcagatggtg cagcccttcc cagagggaag acttcatggg 75600 
ggtccaaatg ctggtaagaa gtttttgttt gtctgtgtct gccatcacag catagagatt 75660 
aagaacctgg tatttggaat cagacagatt tttgtgtatt atttacctgt tgctgcataa 75720 
caaattgccc ccaaactcgg tgactttcta caacaatctt tgtttattat ctcttatgga 75780 
tccctgggcc aagaattcac atggagcaca atggagaagg atggaagacc tgaaaactaa 75840 
actgaaatca tctaaagact tatttactca tgtgagctgt gttggagctg tcagctgcaa 75900 
cacttacaca aggcctctgc attgggcctg aacttcctca caacatggtg actgggtctt 75960 
aaggattggc ctagataaag tcatggcaga gaccactcat ggaaagctat gtcaacaatg 76020 
gaagatgaat ggccaggatg aagacagaga gagagagagt aaagaattcc aaaggttgcc 76080 
atgtttgtct aaccctgctt cacagaacca acagacaaaa tctgtagcct cctctccttg 76140 
tcagatctgt gtattttgta tgggaatgaa tatgtcagaa agtagtagct tagtgataaa 76200 
ttatgcaaca gcctcttctt cattctaggt tcaccagtac tgccaaatta acattacctg 76260 
agcacctcct ttgcccaaga atttttacaa gtgtaatttc atttaattct cacaacaacc 76320 
caaagttata attttaattt tccccccttt ggattagaac ttagaggata tggaaacctc 76380 
tcagtcaccc atatatgtaa actactgcta aaaaggttta ttttaaagaa gtggagctgt 76440 
tatttactct catacattca cctattttag tttcagggag tacttctatg gcggacttat 76500 
gtaatacctg tgatcgtata agaaagtctt ttggttcagg tattcttatc acattatcta 76560 
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acatatctgt gagacttacc tttgatctcc catgtcgtta tacttccaaa gccaccttca 76620 
actaaaatct acactaaaca aagaaaagtg cttcaataaa tctgtgtttc taagcaattg 76680 
tcacctttcc caaaaaagta aaatagcaat ttctatagct gttttttctt ttaatggctt 76740 
tattgagatg tgtcacatac catatcattc acccatttaa agtgttcagt tcagtatttt 76800 
ttagtatatt cacagatata tacattcatc gccataatct aattttaaaa catttctctc 76860 
ttgcaaaagt agcccagtac ccattagctg tcattcccca gcaccctccc accatcaggt 76920 
tctaagcaac catgaatcta ccttctattt ctgtagattt gcctattctg accattttct 76980 
atatatgaaa tcatactatt tgtggtcctt tgtgactggc ctctttcact tagcataatg 77040 
ttttcaaggt tcattcatgt cttagcatgt atcagtgctt catcccttta tggtatatcc 77100 
ataaaatgaa atattatagg actatagtat ttcattttat ggatatacca tattttgttt 77160 
atccatttat tagttgatag acatttgagt tgtttttact ttttgactac tatgactaat 77220 
gctgctatgg aaattctttt acaagttgtc atgtggccat atgttttcat ttttcttgag 77280 
tgtgtaccta gtggtgaaat tgctgtgtta tgtagtaact ctatgtttag ccttttgagg 77340 
aactgccaaa ctgttttcca aagtgattgc accagtttac attctcacca gcaatatatg 77400 
agggtttcaa tttctccaca tccttaccaa cacttgttat tgtctgtctt tttcattata 774 60 
gtcattctgt gggtatgaag tggtatctca ttgtggtttt gatttacatt ttgtaacgac 77520 
tgatgggtgt tgaacatctt ttttatttcc tattggttat tttgtatatc ttctttggag 77580 
aaatagctct ccaattccct gccttttttt tttttttttt ttttttccag agacagagtc 77640 
tcactcttgt cacccaggct gcagtggagt ggtgcgatct tggctcactg caacctccgc 77700 
ctcctgggtt caagcgattc ccttgcctta gactcccaag tagctgggat tacaggcatc 77760 
caccatgcca agctaatttt tttgcatttt tagtagagat ggggtttcac catgttggcc 77820 
aggttggtct cgaactcctg acctcaggtg atctgcctgc ctcgacctcc caaagtgctg 77880 
ggattacaga tgtgaaccac catgcccagc ctcttgtcca ttttttaatt tggttgtttg 77940 
tcttatatta ttgaattgta agaatgtttt aacgtattct agttataagc tgcttatcag 78000 
atatacattt gcaaatattg tctttttttt tttttttttt tttgagacgg agttttgctc 78060 
ttgtcgccca ggttggagtg caatggcaca atctcggctc actgcaacct ccgcctcctg 78120 
ggttcaagcg attctcctgc ctcagcctcc tgagtagctg ggattacagg tgtgcaccac 78180 
cacgcccagc taatttttgt atttttgata gagacggggc ttcaccatat tggccaggct 78240 
ggtctcaaac tcctgacctc gggtgatcca cccgcctcgg cctcccaaag tgctgggatt 78300 
acaggcatga gccaccgtgc ctggccacaa atattgtctt ttattctatg ggttacctat 78360 
tagccttctt tatggtatca tttgcagcac agttttagat cttttttgtt gttgttgttt 78420 
agcttgttta tttgtttcat tttatgtgca gtactcactt ctgctttcat tcatttatgc 78480 
cttgagtgtc cattatgtgt caggaacttt tctagacact ggtgatatgt cagtgaatag 78540 
gatagaacaa tgtccttgct ctcctgaaac taatattcca ctcaagaaag gcagatgata 78600 
aacaaataaa tcaaagattt cagacaatga taaatatgtt taagaaaatg aagtgagata 78660 
atgaagtgga aaatttttat gaggcgtgtg tccagggtgg ggaggcagct ttaggtagat 78720 
tcacactgtt ggcatttggg gccagataat tctttgttgt gttgtagctg tcctgtgcaa 78780 
atatcctaca atgcaaggac agctgcctgt gtaaggatat ctgtagcagg atctctgacc 78840 
tctacccact aggtaccatt agtacctccc tcacttgcta taatcgtaca aaatatctac 78900 
agacactgcc aaatgttcct cagagagaaa aatcactctc aattgagaac cactcagatt 78960 
aatagttgag aaaggcatct ccgagtgatg acctgaagac agagtccacc ttatgggaag 79020 
cagtggagca aaagcattca agatagatgg taaataagtg cagaagcaag gtggggatgc 79080 
acttgatgtg tttgagaata gaaacaaggc tttagagtga cagtgtggtg aataagggaa 7 9140 
agaatagtag gagataaaaa tagagaggta ggctggggtc agattatgaa ggatctagta 7 9200 
gacagtaata agaaatttgg aattattccg agtacagtat aattggaagc cagtaagtca 79260 
gggataaatg ggtgctctca aatataggaa ataagctgta gatttggggt tttgtcccac 7 9320 
tttattttat tttgagtttt tgtgggtttt tgttgttgtt tttgctcaag gatccacaat 79380 
cagagagtga tagagctagg actaaaaccc aagtctccag gctagtaagc tataggtctt 79440 
tctattaaac acgggtgacg aagcagtcta catttattaa ctgtaggatt gcaattggcc 7 9500 
gttactctaa ctgacctttc cctgaatcca ccttgtttaa gaattcctaa tatagtactt 79560 
agcatttggc atgaaaatca aagaagatga actttagcaa gaagctaatc caaatctgct 79620 
tcacagacca gcagcatcag catcacctgg aaacttgtta gaaatgcaga atctcaggcc 79680 
cacctcaaac ctactgaatt aaaatctgtg ccttgatagt ccctagttaa ttagtatgca 79740 
tgttaaagct tgagaagcac tgatctagtt tatcaaaata tcatcagaaa cagtcgagtg 79800 
attcaaatgg ttttttttaa tgggctttta ctgagtcaga agactaatga tggtgtcatc 79860 
attaagctct cattataaca tcttttttat tcctttcttt ctttggctca cagcaattct 79920 
tcactcacaa agtcctatgt agtcttcctt aataacagca tacattcttt cctttcattt 79980 
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ctgcagccac cacctttgtc tttaccctca 
tcaaaattta tgccttcctc ccttcatgcc 
acttcatatg gctttttctt gtcttcagag 
aggccttcta tatcaaaact ccaaacacgt 
taagtccatg tctttgttca caccctggaa 
aggctctgct gaaatcccac ttccatcagt 
gttctacggc acttgttgcc tgtaccaatt 
taatttcagg cacaagcaca ggtcttcata 
cacatctttg tctgctccca ctctaatgct 
cagtaaatat tttgcctttg aatctgaagt 
tcgtttctac taatttattt tctctaaggg 
agcattgttt ataaagccag ccatgggtaa 
tggatgcagt ccagaatttc aggccaaata 
gtattctgtg ggaccaagct ctgttccggg 
ctgagcagct gtatggggca aaaagccccg 
gcccagatat aaaatgataa taacattgac 
tttgcttagt acttctttct taaggttgtt 
gtgtgttgta ggtttttgta tatggtttgg 
gattcctgca gataggcata actacttaca 
gacctcataa acctaaagag gtgcagagga 
ggaagcagag gcccagaaaa gtgatctaac 
ctgcaaccag agctaaggtt gaggctttat 
tcagtgcagc cttaccttac taattattaa 
ggtcttttcc tgttgttcac actcttaccc 
cactgattca tgaaaaggca aatattttct 
cgagagcaat attctctatc cccgagatgc 
caaagttatc cctttggcaa tggttcttca 
aacctattta gatgctaatt ggctttaaca 
ctacaagaag tctctcaaac ctggatcata 
acttcaaaat ggctcaatcc atttgagttt 
tctgtgctaa ggatagtgtc ttaaaggcta 
gaggtttcag aatagacaat tttttttcct 
tattctagga agttcatggc ataatggtaa 
ttaacttagt agctttgtga cctctggcgc 
ctgtaaaatg gaagtcataa tggtacttac 
aaatttaaaa tatttagctc aatgcctggc 
ctagtattac tattatttat taataattcc 
gttcactata gatattcaga atgtccttct 
ttactaatat caaaagtcca tttgtatctg 
gcagggaacc agtcctcaca ctacatggtt 
ctatgttttg cacagggtta gtctttgttt 
tcatcagtat ccttggcctc tatccattat 
agttctgatg accaaaacta tatccagata 
gagagtcact aattacacag tacatttgtc 
gccagtgaca ggcaggcaga caggagtggg 
caaatatttt aggcactgtc tgtcccaact 
ccaaagacaa cacataaatg aacatgttcc 
aatttcatat aattttcatg tgtcactaag 
taaaaatgtt aaaactatac atgagtcgta 
atgagccata gttttctgac ctttggacta 
atccctaaga taaacatggg ataaatatga 
taaattttag agaaagcata ttattttatt 
acccaagcca aaatgaattt tagagataaa 
ccatgggctg ttgacccatg cctcaggtcc 
cccttgcttt taggctacct tggtaggaaa 
agaaactaag tcatataaga gccttttaga 
tccaataagg cacctcaaaa tattctaagt 



ctaatttgta cctagcttat tatagccata 80040 
cttcattgcc tagagctctc cgaatctcaa 80100 
taaaattcac atttccaact ctgactttga 80160 
ttcccagaga atctgttgtg tgtatgtctt 80220 
tgccctttcc catcctttct actgccttca 80280 
gatgtcaaat ctacccagtt gcctacccag 80340 
attttttggt gggggggtct gatagcatta 80400 
actaaattgt agttcatgga gaacagaggc 80460 
tagtaaggca ctaagcatgt agtaggtaat 80520 
tgtggagtaa tgcccagatt tacaagcact 80580 
tctatgctac agttcagcaa ttgtgtaaac 80640 
gcagtctcag atcagctggg tggtaggcaa 80100 
aaattgtgaa tctttatttc aagaaccact 80760 
tgtcagaaca ataggtatcc ctccctgctg 80820 
attgagcctg gcccgcctga acctgggata 80880 
ctgaattctg tattttgtta catttttgtg 80940 
ggcaagagca tctgctggta acttggctca 81000 
gtttcctatt cctgggcatt gaactattaa 81060 
gccttttctg tgtatatatt agctcttacg 81120 
aaagtgccat cattctcatt ttacaactgg 81180 
tttcccaagt tcacaaatgt gttattagag 81240 
gaaccttagg ttactattcg gttaataggc 81300 
aaatagacaa cattaggatt taaaccagat 81360 
actgtcagca atggcccatc agataatcca 81420 
gtttctgttc taagctaggc ttgccaaaac 81480 
attgcttcaa gcacctatct actttctctg 81540 
attctgatta tatctgggag agaaataata 81600 
tctcagtgag tcaacaggct catgtatgtc 81660 
cagctatcaa acagtgttgc tcagccacca 81720 
acaaagcatt ccattttata gcaaaggaca 81780 
ggcttttgag gtggcaatta ttttcttgct 81840 
acattaaaac agagaaatga agagtctgag 81900 
gcagataaat ttctgatatc cttctgaaaa 81960 
cttatcttct ccaagcctcg gttttctcat 82020 
aaggccaaaa tgatgactaa atgagaggat 82080 
acatagtaag tgcttcataa caattgagta 82140 
tatagtcaaa tagaataagg agtccccctt 82200 
ctcaggcctg aatgagtaat tccttctgga 82260 
attgagatga ggaaagaaat gcaggatgga 82320 
tactccagcg attctcaacg ttagcactac 82380 
tggggggctg tcctgcacaa tggaggatgt 82440 
aagatgccag cagcaaacca acctcctcca 82500 
tttgccgact gggggcactt gcccccagtt 82560 
catgatgggg acatgtaggc tcacggggta 82620 
gaattttttt tctgtaaagg ccccaagtag 82680 
actaaactct gccatgtagc ttgaaagcag 82740 
agcaaaacta tttatgggcc ctgaaatttg 82800 
taatcttttt aaaattttta ttcaaccacc 82860 
caaaatcagg tggtgggcta gacttggccc 82920 
agaaatacat gaagccgtaa cttcatattt 82980 
tacttctcct aacaccatca aatttacata 83040 
aaaatgagca ttgctatatt ttggaatagg 83100 
acattatatt ccaaagttat atcttgcttc 83160 
ccagaggcct tcatttcctg ccttcatttt 83220 
acttaagaag ttgccatgcc attttaagtg 83280 
cttcccataa ctggttaccc ttagatagtg 83340 
gctctagtgg ttgtctaata tatttcatca 83400 
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gagaatggtt gtgtaatata cgattgttta atccaaggtg gctcctataa atcctttttg 834 60 
aaatgttcaa gtatctctgt cacacacttg gggtatgagg tttattcact tccattgtca 83520 
actaagcagc atgtgtaccc cacagcttga tgagtaccca gaaaatcagc aaatggtgtg 83580 
ggccctactc tcaggaagct ggaatctgat ggtgatatta ggacctgctc atatgagatg 83640 
acccgcaaag agtacaccat caagaaaggt gttgtggcag aaggagaagt ggagtggcaa 83700 
gcgggaggca cattttctag cacctcctgt gcattcatcc tgaaggagag agttacttta 83760 
cctttagata agtttttctc ttctgcccag agtgcttttt tgagacagat tctgagacat 83820 
catccaagtt cagctgggaa gaatgccaca attgattaga gctgtcagcc gtggatggga 83880 
gatcagggat tggctggcaa aagtgccagg gtgttgacag gcatagtctc agtcatctcc 83940 
acagttcttc ctggctggaa atgcctctgg ctatattacc cttattgcat gatttcatgg 84000 
tactgatcag atccaaagaa ggaggatgat tgtgagatgc aaagctcagg ggacgactac 84060 
gcggtgaagg tgagagaatg cccagagctt tgaagaggct gcttcttctt caaagctgtg 84120 
ttccagcaga gcacattttg tgcatacttg aattttggtt ggctctaggc tgagcgcatt 84180 
ctattgtggc gttggcatgg gaagagggag gcatgcgaga aatattgcat ctctgctatt 84240 
ctcctgggcc cttgtgaatg ggaatgtctt cttaatgagc ctatcccaga gagtctggcc 84300 
tttcagataa acaaataagc atcagcagga gatgctcaca aaaggactgg gcaaaatagg 84360 
gccacgttga gtgacaaccc aaataaagga gtcctaagag gtctgaagct ggtggctaat 84420 
gagcacatct cctcatttgt gatttcttgg ccaggacatg tattgttgag ggtgccagga 84480 
aagtctgtct tattttatac aggaagaagc ctttctggcc tttgtcctgg tactggcttt 84540 
tatattttat gttgggctag gggcctgcct ggaggacatt tacaaggtga ggagaaatct 84600 
caagctgatt ttaacctaac acaattttct tatctttcta gtgatatata tcagttttct 84660 
cacttgggaa atgggaataa aaatacaact cacctactag aattgttggg atgattaagt 84 720 
cggatgattc tcagaaagca cttcccacag tttccaatat gtagtcagca tctgatacat 84180 
gttggctgca atcattatta gtaacagtat tatcatcatc attttcatat gtggaaagat 84840 
accaaaagga gtgtccaggg gtgagggatc tagttccaat tctgctacca gctggctgtg 84 900 
agattttagg caagtcatct ctcatgtcaa ctgagagtgc tggacgggag tttttattca 84960 
ttttgtaaac atatttggag gacctactaa gtgccaggct ctatttccag ggagcaaata 85020 
tataaaggtg aataagacac attttctgta ctcaagagat cccaaaggta atggaggaga 85080 
caggtaacac atgtgaccac atagcatgag aattgctctt actgtgtggc tgcatgccaa 85140 
atgctcgtga ggcctggcac ggaattagca agactaaagg ctctgtccca gtccaacctt 85200 
ctgtgagtct ataactgaag catccaaagt ataagctggg tgattagggg aagaagaagt 85260 
cacctagctg gcaaggggcc taaaactcaa gtcccctcct tctaggacca gagtactcac 85320 
caactaccaa gctgccttaa aataaaaaca ttcaagaaca agtgatgtgc agagaggcaa 85380 
cacagtgata agtaactggg gtctggaaat gggagacttc aacttcaatt tcagcactgt 85440 
tattttggta gtttaatgat ttagagtcag ttgtttaatc tcttagagac tcagttactt 85500 
catcagcaag ggtgttacta tagtaatacc tgctagacat tattattata tggagagaga 85560 
gagcttggta agctgtgaca tatacattct agttggtggt tgtttctgtt gttattatta 85620 
atagtaacaa caataaatat cctgttgttt atttttttga agtgttcagt ggctggtaac 85680 
tgaattagag tgcctggaga tataggtaag ggttttttgt tttaataatc attatctttg 85740 
ttcaagagaa taggaaaaat ataagcctct ttttaacttt aatgcaaaaa agtctttata 85800 
gcttgtgtgt gaaaaacaag aggaagggag gaaggggagg aaagagattg tctatttcta 85860 
cttaaagcta attagagaag taattttagg taagagtatt atgacctgat ttctgattaa 85920 
tttaagtatt attatactgt tgtaatagaa atccctgaag tgctttattg attatctgtt 85980 
ggtagaatga aagggtaagc agctgtagta aataaaattt ttgtctgagg caaagcaaag 86040 
gagaagtaat ataggagaaa ttattataat aaagatggga gtgagagaag aaacctgggt 86100 
tctagttctt tcagataggt gaccttgaac aaatctcttc cctatacttg gcctcggttt 86160 
ccccatcgga acaacctagg gcatactcta gataatccct aaggtccctt tcagccttga 86220 
caaactatga gatttctacc tgtacaatac tgaaacagtg attaagaggg agagacaccc 86280 
acaaagaaag aaatagtttg tgctgtgctc aaatgcctga ctgataaaaa agagaacaga 86340 
ctcataaaac ccactaattc aaattaaatg gataattctt gttgattttg agctggcaat 86400 
ctcaggcttc taaatcacaa ggaaactcca tttgatcctg caatggtctc ttggtctctt 86460 
tcttagctca gtagtgctaa gcacagaggt ttcatggaaa tcaagagaga aagagagaga 86520 
gagagagaga gagagagaga gctgtttaaa aaaagaagaa caatatactt ggatgaacta 86580 
aagagtggtc ctcatgaaag cctcataatg tccattgcct acctgtttaa ccctttgtgg 86640 
cctgattttt ggtcctcaac tgaaatcaaa ttaggtagtc ttatgtaggg aaatgaataa 86100 
cttgtacact gttgttgttt gccatcttta ggaccaccaa agggtaacga attcagcaga 867 60 
cccacaatcc ggcacttcag gtttttgaaa tttactcatg aaaatttagg aagtcctaaa 86820 



89 



CL001205 



ggctgtgaaa aatcattgaa cttcattggc 
ggctacaaat cagagattga aacctaaaga 
ttcccaggca gaactgggac tcgaagtcac 
ccagtataac caagtctcct gtggcctttc 
ttccccaaaa gagttgtagg tgcactctgt 
ctgaggttag aattctcctt ggatgtaagg 
aagggagtga tggatttggg atgctttgtg 
aaccctagga catggaaaag aattacattc 
aaattaggac aacatattga ttaagataca 
tgaataccta ctttttattt ctacttctac 
atttaacttc tgtgagcctc agtttctcca 
ctgtagaaca gcttaacttg aactaaatga 
tatgaatttc tgaggttagt ttatacatgt 
aagtatccta aggagattag gaaatgtttc 
gaagaatagg atgtattttg aacactggga 
aatagcctgc actatccaga gacagtttct 
cagtttaaga aatattccaa tctttgatac 
atttcatttc ctgaattgtt tccctgcgga 
atcacagata cctcttttct tctcttttgt 
aaaaaattaa aggttatctc cagataattc 
attatgttcc cttccttctt gctcttcact 
tctatgtgtt ttggacttta cagtgtgtga 
tgagctgatg caataattta ggttataatt 
agctttagag atgttgcctg acacgtagca 
gatggatgga tggatggatg gatggatgga 
cttgtgtgtt ggtgcccgcc tcgccgtttg 
ttcgttcttc gttngtttcg tcctctcttc 
cgtccgttgt ctctttctcg cgtctcgctc 
ccttgtgcgg taccgtctct gttccttgcc 
cgccgtcctt cctagccatc gatatcgttc 
ctcgtgtcgt cgcgcatcag atctcgtgcg 
ctcacgcgat tgcgttgcgc tcgctttccc 
ccccatgcct ccccccagat aaatcannnn 
nnnnnnnnnn nnnnnntaac tccagcttca 
taccctgcaa tcatttatct ctgggatgcc 
agatggtatg ttgtaatttt gtgatataga 
ctcaccccta tggctgcttg tataagatac 
tgtttggagt tcctccttta gttgagaaaa 
tctggatttg agagggatca atgcttggtt 
accagaaaac tagaagtcag aagacctaag 
taacttattt attcattgat tatgtattga 
agtgagaaca cagtagtgag taagggggac 
agtggggtaa gagagaaaaa ttaaataaat 
aagattcccc aggcacagtg gctcgcgcct 
ggcagatcac ctaaggtcag gagttcgaga 
tctactaaaa atacaacaaa ttagctgagc 
tcgggaggct gaggacatga aaatcgcttg 
agatcacacc attgcgcccc agcctgagca 
aaaaaaaaaa aatgaaagta aagaagaaag 
ttctctgaag aaaaaagcaa tacactgaga 
gtgggtcagt gaatgccctt actgagtaaa 
gatggtattc atgctacaaa acaatttgga 
gcaatgattt gaaaaatgga aggacaacca 
gagtgataga agattaaatc aggaaaacag 
ggccatgaaa agaagagtga cttttatttt 
agtagaaagg gatgtggtct catctgtttt 
ggctcatgcc tgtaatccca gcactttggg 



aggcaggacc ttgctttcaa aactctatga 86880 
gggaacaggg ctttcccaga gtcacctgcc 86940 
agctagacta gaactccact gattccaaat 87000 
ccatgatctg gacccaaaga ctttctgatg 87060 
tctctggttc tttgcagtgt aatggcagaa 87120 
atcaccagcc caccaggctg gcagttagac 87180 
tgaccttgta tacaacaggt tcttagacac 87240 
atcctgagcc ccagctactg ttaccaattc 87300 
ggtagttatg ctaccctaag aactggaatc 87360 
cgcttgttag taacctattc ttgatccacc 87420 
tctttaagac agtgatgata atgacagtat 87480 
gaaaaactaa acaaaacaca tagaattaag 87540 
tttaagagct tagagaaaag gaaaaaacat 87600 
cttggggagg ttggaaagca gctaagcctt 87660 
gaagagatga gaaggtagtg aagtggaggc 87720 
gattgtccta atgctatggg ctctctactt 87780 
cttcattcaa cagtcctttc tatttttcta 87840 
ggagtgggcc tacttgttga aataaatcac 87900 
taaaagttct gttaaaatat atatattgtt 87960 
cccaaatgag taggcttagg ctttaatcat 88020 
tgccaagtta tactaatgat cataatctct 88080 
aggttttatt gccactgttg gccctctcag 88140 
gttggactct ttatacgaat gagttgctag 88200 
ggctctcaat aaatattgga tggatggatg 88260 
tggatggatg gtcctctccc ccctcgcttt 88320 
tgcggtgtcc gttggttccg gctcctctct 88380 
ttcgtgcgtg ccttctcttc cctcgcctct 88440 
ttctcttcgt cgcatctggg ttcgctgtcc 88500 
tctttcgttt gtctcgagcg tcnctcattg 88560 
gtctggttct cgcgtcctct cgtgtcctct 88620 
cgccccctcg ccgcttttct ctcttgtctg 88680 
gcccgttgcc gttcgcgctc cgtcctcttg 88740 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 88800 
gcagagatcc caaggcacag aaagcaggca 888 60 
ctggagcctg tcaaatggag atagggacca 88920 
aaatctggtc aatttccatg aagcaaggcc 88980 
tgtccccttt tgtaccctgt gtagataatg 89040 
atagaatcct tgctcacttc catcccaaag 89100 
tgcagggaga atatgcagaa caaaggaagg 89160 
ttcaagaccc aagttcaaaa cttataaact 89220 
ttccttactc tgaggcaggc tttgccagac 89280 
aagacacctg acttcatgga ccttagttca 89340 
caactgaata agcaagataa ttgtaagata 89400 
gtaatcccag cacttcggaa ggccaaggca 894 60 
ctagcctggc caacatgaca aaacccagtc 89520 
atcatggtgc gtgcctgtaa ttctagctac 89580 
aacctaggag gcagaggttg cagtgagcca 89640 
gcagagagag actctgtcaa aaaaaaaaaa 89700 
aaagaaaaga aaaggagaaa gaaaacacga 89760 
gaaggaaaca tgaaggtatt cgtttaaata 89820 
agtgacattt aagcctagat ttcaatgctg 89880 
gaatattcta gccaaaaggg aataatttgt 89940 
gggtagctgg aatgtttatg gcaagggaga 90000 
agaaagtaac agatgaggca gtgcttgaca 90060 
aagtgcatga gaagccactg gaaggtcaca 90120 
aagaatttgg tccagtcggc tggccgcggt 90180 
agcccaaggc gggtggatca cgaggtcagg 90240 
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aattcgagac cagcctggcc agcatggtga aaccccgtct ttactaaaaa tacaaaaaat 90300 
tagctgggca tggtggcgtg cacctgtagt cctagctact tgggaggctg aggcaggaga 90360 
attgcttgaa cccgccaggt ggagattgca gtgagccgag atagcaccat tgcactccag 90420 
cctgggtgac agagggagac tccatctcaa aaataaataa ataaataaat aaaagaattt 90480 
cgcccagtca ctgaacttct ttgtgactcc cttttctctc ctgaaaacaa ggattaaact 90540 
actgattgtc cttccaagag gttgtctgag tgtcagatca tgttcattat aatactttgc 90600 
tccaagttgt gcttgataaa ttttgaactg aattgtttca taacaatatg aaaatgcttt 90660 
ataaactgta aagggccatg caagtataaa gtattaccaa taaccatcat ttccccaaag 90720 
ctctgactcc tttccaggag tttgtgtgga acaggtaaga taactggcca taaaatgctt 90780 
tgaaaaccct atgctaataa aagatacttt atatttatgt tcatttgttt attgaagttt 90840 
gcatttggtt cctttattcc atagcaccaa tgtattctct atttttgggc taggaaacaa 90900 
caagcagact caagtgtgtt atttaacctc aagaaataaa aatatccaga gataatatga 90960 
tgagtgaggc tgagagtcaa gtcagcaagt ctttggttag aactgggcac ttcttcactc 91020 
tcttacctga gtagatggga gcttgtcctc attagtcata ggccccccgt tacttataaa 91080 
gaagttgtga aaaataatac tgacaatgat gataatttga atgacacatg tttagtaatg 91140 
catatagata tatactcatg ggtatgtata tacagtccct tgctactttt ctcaaagcat 91200 
gtttttctca tctattgttt taatttagct cacactcaca tgcagaggta gagtaggagt 91260 
attccaattg gagaaatggc catgaagtgc ttaagtggtt tgcttcaagg tcacactgtg 91320 
agtgatagag gagttgaaac caagtcacag agctccctag tcctgtgcca gcactctgta 91380 
tcaagacccc tcagctcctg cggggccttg tgaaaaaaca aaaacaaaca aacaaacaaa 91440 
caaaaaaaaa caggatcctg agcctcacat taaacaatta ggaactggag cattttcaca 91500 
ctttagaaaa tatttgatcc atgtagccta gccctcttcc ctcatgtcac atacagggaa 91560 
acagaggacc acaagtagga gcccacttgg gcaggtgagt tttctgccaa ctgctcaagc 91620 
acctactgat gcttcagctc tctgtctttt aacctcatat gtagctggtg actaaggtgg 91680 
atggagataa tgcagtagaa gtcaaggtgg tcccctctga ggaccatctt accagagagg 91740 
gaatcacagt aggaagagaa ctcttcagtt aattgataat tttttctttg caatggaagg 91800 
gaggtctctg tgaccttggg tctgacctcg agcctgacat gcctttttcc ctagccctct 918 60 
cttgctctcc cttcagggat ttgggcaagg taaatcttca agttcattta ccttctttaa 91920 
aattgggcag ctttacacat tatcttggcc aatttgagcc taaaaagtga tgtagctcag 91980 
aagcagtcaa gtttagttgg atgaacatga gtattaggct gggaagctca tgcccaagtg 92040 
tgaaccttgt agccttaatc aagtgtctgc cttagagcta agcttctctc ttggtgaaat 92100 
ggtttaaaat acccatccag tacatgtcct aaaatggaca tgaggttcct ctgggataaa 92160 
ggatgagaag atactttgta ttttgtaaca ttctactcag atgcaggatg ttcatctcat 92220 
taatattaca ctctggtgcc agtttctggg gggtcagctg cagagaagct agttcaggct 92280 
gcaacatctc attagaagct gaaatttgga ttcaggaaga ggagcatgga gtgggtgaga 9234 0 
gccactgtct gctgctcaag ttcctgctgc tatctgtatt cagagaggca tcctggtacc 924 00 
ttagcagtgc tgctcagtgg gacgtctctg actttgatgt tgcacgtgtc aacattctaa 924 60 
tagggcagca gccgagatgg ggtatgagtt tggaaagaca tcttatgaca gctttttcct 92520 
aaagatgttt caagagataa ccctttagaa ataacgaagg tagtttatgc atttcatagg 92580 
aaacccagcg gcttacgtag cagctcactt cgaagtctgg catgcggacg gcttcctact 92640 
gtcccgtttt tgccctccct gcacatctgt gagaaatcat cacttgtcac agtaacacag 92700 
tattgtattt agttacattg acagcaatgg tgaaaggact ataagcctat ttccttctaa 927 60 
actgttcaca tcaaatgaac cttaaacatt tactcctctc ttaaaatgta aatcagatca 92820 
cctcagccta ctgtttaaac attctatcat ggctttccat catgaataga atccaaatgc 92880 
ctaacacaag ctggatggta gcctgatctg acctttgctg acctctctga cctcccctca 92940 
cacacagctc cccttgctca ctgtgcttga gccacactgg tcttcatttg tgctcaaata 93000 
tgccaagttt tctcctacct taggagctac tattaaaata cctaactaaa gctctaaagc 93060 
tcttcatgcc tgcaccactc ttttctaaga tatttcctgc ctgcctcctt cacatcgagt 93120 
ctcagcccag ctcagccatc tcttcagtca gtacttcgct gaccttcccc tttggcccca 93180 
ctcctacccc tctgtacatc tccattacat tgcctttggt aatgtcttct tagcacatat 93240 
cgatttgttt acctattaat tccatgccat gagaggagag attctttctt ttctcttttt 93300 
tttttttttt agaagggtgt tgttctgtca cccaggatgg tgtgcagtgg catgatctcc 93360 
gctcactgca atctctgctc cctgcctcca aggttcaacc tattctcctg cctcagcctc 93420 
ccaagtagct gagactacag gcacatgtca ccacgcccgg ctaatttttg tattttaagt 93480 
agagatgggg tttcaccatg ttggccaggc tggtctcaaa ctcctggtct catgtgatct 93540 
acgtgcctca acctcccaaa gtgctgggat cacaggcgta agccaccaca ctcagcctga 93600 
gatcctttct ttcttatggt ctcctgaatc tctagcatgg aagaataaag gtgccgtagc 93660 
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acagaataga tctattatcc aatgaatatt tgttggagga atgaatcaat cctctctttt 93720 
tacagaggga gaagttagga ccagagaaga gtagacatca ttgtgtttct gaaggtttta 93780 
tgttagcacc tttgggccat gctgggacct tggcaaatta tagcattcaa gggttttagg 9384 0 
gtaatgttca tggatcccta gaaagtcagc aaatggggat cagggtgtct tgagccctgt 93900 
gaaagtgttg gcatatttct ggggagaaga ttcatagctt ttattagtac ttcaaataga 93960 
tatttgatcc attggtagca cccatcaatt tttcctttta attagcttag tttggggaat 94020 
gtggagcgag cctaaaagag gatagatgtc tcagcaccac cttgaagacc tttcttcttt 94080 
ttcactgaca gcatggttcc atcgttctgc ccctgaatac ggggacagct cctctttttc 94140 
tattaccaga gcatatcttt cttaaattca ctatttccat gatgaatcta cagtgccaga 94200 
agctagagtc tggaatcaaa gaattccaga gttgggagag cacttaattc agtgttcata 94260 
aaccaaagtg tacagcagaa ttttctgggg agctttttac aactatgcgt ttcccactcc 94320 
ttcccctgac ttttatttgt ttgtttattt ttgttagaga taagatctta ctctgtcacc 94380 
caggctggag ttcagtggca cgatcacagc tcactatagc cttgaactcc tgggcacagc 94 4 40 
aatcatccac ctgagcctcc caagtaacca ggactacagg gggtgtgcca ccatgcccag 94500 
ctaatttttg tatttttggt agagacgggg tttcatcatg ttgcccagcc tcatctcaaa 94560 
ctcctgacct caagcaattc tcccacctca gcctcccaaa gtgctgggat tacaggcatg 94 620 
agccaccaca cttagctaat tgtagtattt ttggtacaga tggggcttcg ccatgttgcc 94680 
cagactcatt ttgaactcct gggctcaagt gatccgtctg cctcagcctc ccaaagtgct 94740 
gggactgcaa gtgtgagcct ctgcacccgg gtcccctgac ttttagaagt ggagtatatt 94800 
aggttcctag tgtttctgta acaaaattcc acaagcagga tggcttaaaa caacagaaac 94860 
tcatggtctc atagttctga aggatagaag tctgaaatga aagtgttgct agggctgtgc 94 920 
tctctctaaa acctgtaagt gagaatcctt ccttgactct tcctagcttc tggtggttgc 94980 
catcaatcct gggtgatcct tgacttgaaa ctgtcaaaga actcctgcac tttaatttcc 95040 
tcctctgtca ccacatggct ctttcccctc gtgtgtctgt gtgtcttttc ttcttataag 95100 
gacaccaatc aaattgtatt aaggtccacc ctactccagt atgaccacat cctaactttt 95160 
tatgtctaca atgaccctat ttccaaataa ggtcacattc tgaggtactg taagttgaaa 95220 
ctttgttttc aggagatacc caaaacatga catgacatgg ggtatgggaa tctgtatttt 95280 
ttaaaaaaac tccaccagtg attgtgttga gacacattgc tcttatacac atgcccttca 95340 
ctgaaaaggt gagagaaact gagggctggg aaagggacaa gctcagagtc actcagagtc 95400 
tcaatgaccc agactggggc accaagtatc atgattcccc caacattctt tctaatgtgc 95460 
caagctgcct tttggagtct ggacctggtg atgctctcac tggtctccaa ggaccagcat 95520 
ttaagtgaat atttccagtg gtccatgtct gccccaaggg gctacttact actccagtgt 95580 
aactagaggg gcattagtgg atcttttggt caaatcaact gcctccatcc caaaggatat 95640 
ggttttgggg caagttggac tactccagca gtcatggaca ggcaccccag cttcctccaa 95700 
gttgtactca tgttgcccca aattactgtg gcggatggct ggtactatga tgaaattctg 95760 
tgttatatgt gtgagaagac ttccttcctc atccatcact tacccagaac actaacatgt 95820 
ggtatatatt aacacgatcc tctcttgggt tagcatgtgg gtgggttctt aggaaaataa 95880 
attaacatct ggaaagacaa cccaagcctt gcccacagtg tttgagatgt tactcagtgg 95940 
aaatattggt tgtcgtgccc aatttgtgtc ccctcccatt gtgtttttat gctggaagtc 96000 
tgcacaatct gttgatcccc gtctcaactg gctccctcac agcagatgca tttaaagtca 96060 
cagatctcaa agtaagcaaa tctaaagtgt ttaatctatt attaatcagc tagttgccca 96120 
aatgatagag ctctagggga ccctgaggaa gacatcaata tttatggcct ccttagggag 96180 
tgcatatttt tctctctctc ctttattccc acctttcttg ccttcagatg agaatggaaa 96240 
tatgaagcct gttatgtttt atgccattgt acaaggcagg aatgtcatgt tttgaaatga 96300 
gatgactaca atgtggtatt ttatgtttgt ttgagaatat tagctgttag ggctgtggta 96360 
acctctacca gattgggtta aacaaaagca gtctgcagtc catgataata caatggaaaa 96420 
gcatagggaa cacaagaaac ctgggtgaac tttttcacaa acaattcatt tttctaatga 96480 
ggcaactaaa gctcagaaaa ggaaagggtc ttccttgtga tcatacattc attttatcat 96540 
tcaacaaacc ttaactgaat acctcctatt caacagtgca cgagctgcca ggaacaatga 96600 
agtaagtatt aacagaaatg cctaacattt actgagcact tcttaggtgc tgggtacaca 96660 
ctaagtcctt caagtagtct ctcctgctga aagtgctatt ccaagtgcaa gttggaacag 96720 
caggtgcctt ggcaccaaag ctctatgtga ccgacctttc tgacctggcc tcacttcctt 96780 
ccccctcccc tcatttctct ctcttctctc cctttatttt tcttttcttt ctctctctct 96840 
tttttttttt tttttttttt tttttgagac agactcttgc tctgtcgccc aggctggagt 96900 
acactgtcgc gatctcagct cactgcaacc tccacctccc gggtccaagc aattttcctg 96960 
tctcaccctc ccaagtagct gggactacag gcacccacaa ccatgcccag ctaatttttg 97020 
tgtttttagt agagacaggg tttcaccata ttggtcaggc tggtcgtgaa atcctgacct 97080 
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caggtgatcc 
cccggcccct 
tggaatataa 
agattcctct 
ttttcagcaa 
tcgctatcat 
tgatgtgctg 
gtggtgagga 
ctttactatg 
gggaagagat 
ttgttgtctg 
atgtaagctc 
ctggaacagg 
tgaatgaatg 
accgagtaag 
cagtgcatgg 
aatcccaagt 
agaaatagac 
taatatacac 
aggtaatatt 
gttgcttccc 
caggctttac 
gccggatatt 
gtcttaactt 
gctctaggtt 
ttcaggtggc 
aaatctcatg 
ggtcatgggg 
ataagatcta 
ttgccatgtg 
cctccccaga 
attaaacctc 
gcctaacaca 
gcttgagaaa 
cccctcccca 
cct teat etc 
ttcaacatac 
acaatgataa 
atattctagt 
actgtgatga 
atttgacata 
aaaaaatgag 
gacatgtaca 
ggttggtttg 
aatacggcag 
ctataatacc 
ctaaagcact 
attactcctc 
agcccactgg 
cccactgttg 
gtcaaatagc 
ttgettaate 
ttaaatcaga 
taaataatag 
tactagctag 
atgtataacc 
attgeatttg 



gccctcctcg 
gtcctcattt 
atcaggctta 
tcccatcctc 
tttcttgaaa 
aagatagtca 
tcaaatcctt 
gggggagtgt 
gctgattcaa 
atacacagtc 
aaattatcat 
caaaggaaga 
gecaagaata 
agaattctca 
gagactaaaa 
gattgecagg 
tatatgaatt 
acttgaacaa 
tcagagacct 
tgatggagtc 
acctggaaaa 
tcaaatgtaa 
tcagtacaca 
ctcctcactg 
gaattgetea 
tactcacctg 
ttgaattgta 
gcagatccct 
gttgttataa 
acatgettge 
ageagatget 
ttaaataaat 
tagtgagtgg 
ccacagagtc 
atcccaaata 
cacagctatc 
gctgagttgt 
ccaaaagcag 
ggggcaagca 
aaattaccaa 
atcaggaaca 
aaagcattaa 
aatattcttc 
gatggggaca 
tgttagcctc 
tagtctcccc 
gggtttgtgt 
tactccaaaa 
catcaaagac 
ggaaaccata 
tgatgtaaat 
ctgctgagcc 
taattaaata 
tctattagac 
ttaggcacta 
ctttggtcac 
cttcataaca 



gcctcccaaa 
cttatgctcc 
ttgccccttt 
atttctggct 
gtgagtcacc 
ctatctctta 
aacacctgag 
gctgatttca 
tagttcaagc 
cactcttgtg 
atttatttgt 
ggaaccttat 
tgatagtaac 
caataactcg 
tgtggctaga 
atttgaatca 
taacaaagtc 
aaggggaata 
cggtgagtga 
ttggaggata 
actgacatta 
agaaaagcac 
agaaaactga 
ctgatcttgt 
cccatttctg 
ggatggtagt 
atccccaatg 
cagggcttgg 
aatgtggcac 
tcccacttca 
ggtgctatgc 
tatccagtct 
acagagtact 
atcactgaca 
gctactaggt 
attctggtcc 
tgtctgetet 
acatgactcc 
gttgtaaatc 
ggggagaagg 
ccagggaagt 
ggtatcgaat 
aatggaagga 
gagaacacag 
tgtgtctaga 
ttctggctca 
gtatgtgtat 
aatgtcagtt 
ttctgtaatt 
taacatgatg 
cttgtttctt 
tatggtttta 
aagecttgag 
tatacagect 
tttctattgc 
cagatcaata 
cctaatacaa 



gtgctgggat 
agectcagtg 
ggacctttga 
actcatattt 
ccattctcac 
acaaacctgt 
tctgatgggg 
ageatttgee 
tatcagcatg 
agecactatg 
ttagttgtaa 
ttgttctggt 
ttaataaata 
gttaggtagg 
gattaaataa 
aggcaatttt 
actcctcttg 
atgcacccat 
gaaccaggag 
tcattggagt 
tgaaatcttt 
agaagcatta 
tagtccactt 
gacatgttct 
cttctgattc 
gtgatatagt 
ctgaaggtgg 
tgctgttctt 
ctcccccacc 
ctacccacca 
ttcctgtact 
caggtatttc 
attgeagtet 
actgecttgg 
tcctttgtca 
aagatatggg 
gtcccaggca 
tacactcaca 
aaataaccac 
acatggtgct 
agtcacattt 
ggaggaaagg 
ccttagctcc 
gcagagaatg 
tatttgaggg 
ttttgtgcac 
gcacgcactt 
gtttttctag 
tagcatcaac 
ttagaacata 
ccattttctg 
tacttttaaa 
cataatccat 
actcactatc 
ttagaacaac 
tcttcagctc 
tgeatgaage 



tacaggtgtg 
gctttcattg 
atcacctgtc 
caggtcttag 
ccattaagag 
tttagtgtca 
tggtggtgag 
aattttcatg 
atgtcactga 
caccagctct 
ttcttttacc 
tatggttgta 
actgtggaat 
tgctactatt 
cttgtccaag 
tctccagagt 
aagagctcaa 
gacaatgaca 
agtggcatgg 
gctttttcca 
cttggataag 
ttgagatggt 
gaggcaegtt 
ctgatgaagg 
tttgcctcct 
ttggatgttt 
ggcctagtgg 
ccaataggga 
accctctctc 
tgattataag 
gectgeagaa 
tttacagcag 
ctgattgttc 
cactccctgc 
ggtcttgeat 
tctattcatc 
atgtggtagg 
gatcctattt 
ctgcataaat 
gggggagtct 
ataccaaaat 
tcattcctgc 
tgecaggata 
atgtgggatc 
aatcaatttt 
cctttgccaa 
cccccaaccc 
tgcctgcaat 
taaactttca 
gtgacagaag 
cctttgtgac 
ttgaaattat 
gatgtactga 
tctcattctt 
ttctttcgea 
cttcctttca 
atgcttttaa 



agccactgtg 
ttactgtttt 
catgetgact 
cacaaaatct 
atagtgacta 
tcctacacag 
gagggggagt 
atgtaaagac 
atgagaagtt 
gacatacttc 
ctcctatgca 
tttctagcat 
gaattatgaa 
atcctcattc 
ctcacacagt 
ctcttctcta 
agtctagaga 
aatatgtacg 
tgattaagta 
ccatgccaaa 
gaagagaaat 
ttgagggtca 
tccagacagg 
gcttttttaa 
gcatcaaaat 
gctcccaccc 
gaggtgtgtg 
ctgacttctc 
actcttgttt 
cttccagagg 
ctgtgagcca 
tgcaagaaca 
tagtagecag 
ctcttcttct 
catacctttc 
cactcattca 
ctcagggaat 
ccaggggggc 
ggataatgee 
ctaacagggc 
cttaggttct 
caattgaaaa 
gtgaaaagca 
aggctgaata 
tgcactccct 
attcctcttt 
ccatgcctcc 
gtaaggecta 
gctgtcactt 
gagctttgga 
ctaggataga 
agtgtaaagt 
aagtaatgat 
atctagtatt 
tctatccaaa 
ggtactatat 
taaacattcc 



97140 

97200 

97260 

97320 

97380 

97440 

97500 

97560 

97620 

97680 

97740 

97800 

97860 

97920 

97980 

98040 

98100 

98160 

98220 

98280 

98340 

98400 

98460 

98520 

98580 

98640 

98700 

98760 

98820 

98880 

98940 

99000 

99060 

99120 

99180 

99240 

99300 

99360 

99420 

99480 

99540 

99600 

99660 

99720 

99780 

99840 

99900 

99960 

100020 

100080 

100140 

100200 

100260 

100320 

100380 

100440 

100500 



93 



CL001205 



ttggatggat gaataaataa atgaataaag aaatgaagta aaagaaagtc aatttttata 100560 
ttattctaag tgagggaaaa aagagaaacg aatcaaaata tcttggaaat aaaattctgt 100620 
tcctctctga gcttttgatt tgtttataag ctggggagca tgttgcttac catttattta 100680 
gtctcacaag tatgttaaca tcatcaacat aaggtttatg aagtacttta tactgcctgg 100740 
aggaaggatg gtatagaaat ttaaaatatt atatatgatc cttccaggaa gaaaaaaaga 100800 
agcaatatct attctgtgag gtgcatcaat tttggatcac tctaatggaa gtgccccgag 1008 60 
cagttggttt atttcttcaa atgtgaatta atttatacat ttcaaagctc ctgatggata 100920 
cttttcattt taattaagta catttttgcc aaatttcaac tttaaaaact caacaaattt 100980 
gttcccacac ttgctttgta gaatttgcaa tattagatat aaatttatta taaaagggta 101040 
tgttagaaac ttctcatact gaaatcggcc acccagcaac tttttcttag gattcaagtg 101100 
ctctaacatg tgcttgcttg tttgtctgtg tgattcggtg gttttatttt gattttcata 101160 
gaaaataata aatatgtctt gaaatgatca tttcattact gagtattgcc agaggttcag 101220 
agtccttgtg tgcatctgca tttactctca ggttggcact ataaactgct actgcaattg 101280 
tgataaacta tcgagaacag aaaaaagaaa atttgataca gaaataatgc atagtaaaat 101340 
aatgctgggt gagggctcac tgacataata atgtactgta tggaaaaaga gggaagatgc 101400 
tgttgaagaa actgaatatt cacgcagcgc acagtagttc aggtgtgctg agctcacaga 1014 60 
atcaatgtga ttgcacagta ctatattgct gtcacaaatg gtgttttgga gtaaacaaat 101520 
accattgttt ctaacattaa attaatattg gtgattcaaa atgtactgaa attataattt 101580 
gtgtttgtct catttgtaaa tttcctttgg ttttatagtt ttaagatagc tataaataaa 101640 
ggattgatct ctgacttcat gtttgtacat tttcaagtat cattatagta aatataattt 101700 
gtcaacattg tggattcaga agaacagttt ccttcaaaaa gcgttcataa attacatcag 101760 
ttcgaaaaaa ttgcattagt agatgatagt aaaagcctcc aagaaagtgt tgcattatgg 101820 
ccaagcttta ttataccaag ttcacaaaac ataaagggag gaatcagaaa tgcagagaat 101880 
ggcagatatg gaaacagcag ctttgcaggt aagagtaaaa tactgaagtt ctaagaggtt 101940 
tttagggttg gataaaatgg aaaaatcaag actgagagac cgccagagtc ctgtgagtat 102000 
tgttgatgaa ctcttaaaat gtgcaaactc aacaagctat tgaaatgtgt gtgtgtgtgt 102060 
gtgtgttaca atatatgtat ctcttcttca atgctttgga gatctttttc ctacaggact 102120 
acttctctaa tttaccaata acaggctttg tggaaatgat accaatttta aagaaattta 102180 
ctttacacct atattttcct aaaaaaaaat ttgtgaaaca agggcatcct tattttacct 102240 
cttcaaaact gttatctata ccaagttatc ataaaagcag taaacctgca tttgttagtt 102300 
tttaaacttt attttcaact tcctatgtct ataaatgttt gttcttgttt aggatgtgta 102360 
ctgtgcttgt tagaagaata ccaccttttt ttcttaccct tttaaagttg agaagattat 102420 
ttgtaagagt gtgaaatggt ctaagcattg cccctttaaa tgggggtatt gtgttaattg 102480 
taagcactgc aaagtgggtt gctatattgt ggctgttgta ctcagtgtca aaagatttag 102540 
ttccttcttg acccagtcct agttattcaa gagtcatcaa acagagatac acaattttaa 102600 
attgtttttc agaatgaatc tgaagagagc gaagagttga gtggagaagt cagctagatc 102660 
atccttgtct atctatgcag actccttccc ataatttttc cccaatctag tttatgccta 102720 
attttatacc aggaatttct tcctgacctt ttaattgcct gtccttaggg catgaaaatt 102780 
atgagtgtaa ttttacagac cattcttaac ttttcaaaac cattccaacg atattcatct 102840 
aagaaatggc cagtgtttgt ggagcactaa tttgtcacgc agcattgtgc tagacattca 102900 
agatatccca ttgagtatcg cacgaaaacc ctgaccaaca catccctata ccaactaggt 102960 
cagatcctca ttcacaggta ttcataatac ataaaattcc cctgcatagc actaggtcac 103020 
atgtaggcaa taattattta tcttgtatat gccttttcac tcaactgtga gctcctagag 103080 
gacataggta aaatctgttt tgttcactgc tgaattccta gaacccaaca tagtatctag 103140 
caccaagaag cactcaatag aagttggatg aactaaagaa gaaatggttg gtctaggaag 103200 
gggttgggac cataagaagc atactgttat ttaagaagca aggaaggcat ttaaaaagca 103260 
cagaattgaa taaaggcaga ttcctgcagc aagaatgcca cacaagtcag aaagtaggcc 103320 
agattattaa ctacaaacta aggaagtaga ataacctttt gtgattgcaa catgaaagca 103380 
aaagtccaaa ccaagaatca tctcaaaaga tagaaagtat attaggaaaa catgctttaa 103440 
ctgtacccgg aagaaaaaga agtgggctac cccttttagt gtgtgaggag ggaaagcaaa 103500 
tgactgattt caaaaggcaa aaacatctcg ggctttattc ttttgcccca ccttagtttt 103560 
tctttcccag agtaaaggca gtgatattat aaaagcacac agcatccaga ggggttgggg 103620 
aggagtatgg ccatgaaact aagggctgat gaaagatgat tttagaagtt ggttcttttc 103680 
atagcaagag gctgtattcc ttgaactgtc caagccatgg cctgttgttt ttcagaactc 103740 
acagaaggtg gtaaaggtca gagagtgtcc ctgaaactag tagatggctc tagattccat 103800 
tgaattcctg ccaaggggcc tgacagcata gatgaattct actacttcca gcttcctcta 103860 
gagttagcag agtccaggtt tgggagtcag gagacccaga ttctagtcct gtctgtgccg 103920 
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tggtctcttt gtggttttga gcatatcact ttatctcttg tatcctcagt ttcctcttgt 103980 
gtaaaaagga gatttttctc atcactagtt ggtaggacaa ggacattttt aatttatttt 104040 
ttctccctaa ataatgcatg cataacttac aaaatcaaat gttacttcaa gtcataagac 104100 
aaaattcaac attctcttcc ctcctccctt tctatcttgg attccagctc tcatagggga 104160 
tcactttagg ctctcttagc tgtttcttct gatattttac ctttgtatct cttgtattgc 104220 
ctgttctggt ttgggctgct ataacaaata ctatagactg ggtggcttaa acaaaagatg 104280 
tttatttctc agttctggag gctgggaagt acaagatcaa tgtgaggcca atttgggtct 104340 
tggcaaagac ctgcttccta gtttacagat ggccaccttc ttgctgcatc tttgcatgaa 104400 
agagagagga gagacagaga gaaattgagt ctctttctct tcttagagtc tcatcatggg 1044 60 
gctccaccct cgtgacctct tctaaatctg attaactgcc aggcgcagtg gctcacacct 104520 
gtgatcccag cactttggga ggccgaggtg ggtggattgc tcaaggtcag gagttggaga 104580 
ccaacctggc caagatggca aaaccctgtc tttactaaaa atacaaaaaa ttagcctggc 104640 
ttggtaacac atgcccgtag tcgcagccac tcgggaggct gaggcatgag aatcacttga 104700 
gcctggaagg tggaggatgc agtaagctga catcacgcca ctgccctccc acctgggtga 104760 
cagagtaaaa ttccatctca aaaaaataaa aaataataat aaatctgatt accccccaaa 104820 
gactccatct tctaatccta tcccactaga gattagggtt tcaacttatg aatttgtgta 104880 
gggggcacaa acatggagtc catagcaatg ccgtttcctg atttttcaat tttagatatt 104 940 
agtttttgac tgtactatgg cagatgcaga atattaaatt aatattcagt atttacgtta 105000 
tgatgactaa ataaatacct tcacaaccaa gccaaatagc acactcttaa actttttgtg 105060 
tttctgcttt attgtttttt gtttgtttgc ttgcttgctt gctttgttta ttttattctg 105120 
catattttat cattgactca gatccaaaat ttctgagcaa accacagaat tcctcccagt 105180 
tacaatcagg agtgtgagat gatctctcag ttgcatggtt ttcccagaga ccttcctctg 105240 
ggagccccat tcagctgaag cctgacctgg ttgctgctgt gacctgcagg agagctatct 105300 
tcctgggacc aacacttttc gctagttcag gctctcctat tctctatatc tcattcttcc 105360 
tcattaccag ttatgctctc attttggcag aatgcatttt cccgtagttt cttaagaaag 105420 
acaacatgag ggtaaatttt tagatcttgc atatttgaaa tagtttttat tccactctta 105480 
aacttcattg attggagatg aaatgcaagc ttggaaataa tttttattca gaattttgca 105540 
cacattattt cattatcttc ttttttccat tgtggctgat taaaaagtcc aatgttattc 105600 
tgcatgctcc ttctttgttt gggatttatt gtcccttctc ctcctccaga agctgttacg 105660 
attatctttt tgtctctaaa gttctaactt caagacgatc gcctttttta attcacaggt 105720 
cattcatggg gctagaaaga cattctgtaa gctttttcca cctgaagact cagacccttc 105780 
agttttaaga gattttcttt tgtaatttat ttgatatcat ccttgccttc atttctgctc 105840 
tttcatttgc agttccatat gttaggcttc caagattcta tttctttgac ttttatttta 105900 
ttttttgggt gataccttca acttcgtctt tcagtatttt tgtcaactaa tatatttatc 105960 
tactgcatta acttcctaga gttctttttt gctttttgat ttttccttca ttatagcatt 106020 
ctattcatac tctagaatga gtgtatgttt ttttaatgtg ttcttttggt ccctaaattg 106080 
tgcttgtttc ccatgattta ttttcattca tttgcttgtt aattttaatt tctccctctc 106140 
cctcctccat atttagaggc ccttgactat ctgctcttat aagtaccaca aagccaatgg 106200 
gctcttctgc atgcaagtag gaagatggcc agtaagtgtc ctctccctta ttctcagcta 106260 
tacctggtga tcctaaccta gagtctaaat actttgtctt cttcagagtc cacccccagt 106320 
cttctgcttg gctgaaaaag agggattacc tggctgcata ggctagggca ggggctctgg 106380 
ggctttccac caggttttac cccatcccat acctcagact tcaaagtatc cagtgcttca 106440 
tatttttaca cctttcttgt gatctgtggt tttatggctt tcttcttatt gactctactc 106500 
acttcctctt cacagatggt tatttcagct ttcttcacct tgctaagtca gttaccttca 106560 
tccactctcc atccttcaga tgttgtactt cctttgtctt ctcttccctc ttttaagttt 106620 
ctctttgatc tgtatattca cacctatttt attattttgc tgtgatttat gtgagtttgg 106680 
gggagatagc aaagataagc atgtatgttc cacatcacat gcatcagaca cagtgttgtg 106740 
aaactgtaat ttttaaaata aacttttatt gtagttttag atttacagaa agttcacaag 106800 
gttagttcag agagttccca aatactctgt gctccatttt tctcccctat gattaaaatc 106860 
ttacattagt gtggtgcatt tgtcacagtt aatgaaccaa tactaatact aatacattat 106920 
tattaactaa cactaaatac ttgtttagct agctctttgg aaagaagtca ctatgtccag 106980 
tctacattta agaagtgatg agttacactc tacctctttg agggcagagt gtctatgtaa 107040 
attatttcaa attattctga ctgggaaatt tgtttcttct cactatttat ttacatatcc 107100 
agtcatttat ttatatcaat atggattcaa ggatatctat tttatacttt gggttataat 107160 
tcaataccat ttcatttatt ttattgctca cattgtgaaa tgctttttaa ttgtaaacct 107220 
ttatctaaaa agcaagatat gagttaaata atataatata atatatatat aattaatata 107280 
tataatataa tgtaaatacg gtctacatct tagaaatagt tctttagtcc tttactaact 107340 
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aacaaagtgc 
tttggctaac 
ttccatgaga 
aaatacacat 
ttatgtcatt 
gcatccatca 
gagagtcttt 
aaaattatca 
ccagccactt 
atccaacaaa 
acactggctc 
atagagctcc 
gtacaaatgg 
ttcatcaact 
agatggagac 
ccttctcaaa 
tccttccttt 
gtttaatgaa 
gccactcagc 
aatggtctga 
gcagcctcct 
tttaaggtcc 
tgcagagcct 
aactcacttt 
gaatatgaat 
tcctgcaccc 
aggggatcac 
attctggttg 
cctcactttt 
tcagtgagat 
ttttacctat 
ctaaaaaacc 
ctgtaattcc 
gacaaggctg 
ctggtgctgt 
cttgagccta 
ggtgacaagg 
agaatgacat 
atgtataatg 
tatattatgg 
aaactacttg 
cctttgattt 
tgggttgagt 
aaattaaaag 
gtaagaaaaa 
tgaatgccaa 
ctgagtttga 
tctgtatttc 
ggctcttgag 
ttaacagaat 
aggtttaata 
ttttatttga 
gctcaccgca 
gctggaactg 
tggggtttca 
tcagcctgcc 
attttatata 



tagacacaga 
ttagtgcttt 
ggtcacatag 
atatagacaa 
ttcatagttt 
tggctggcat 
taaccacaca 
aatcagaagc 
tatttttgtt 
caattccatt 
actgatcccc 
attttgcaga 
gagaggtgga 
tggagcccct 
tctaaaaact 
tgaaatgtaa 
tccaggaaga 
aaatcatatt 
tgtcatacag 
gctcacccgg 
ctgctaggaa 
tttgaagggt 
taagccctgt 
ccctcttttc 
tggttattga 
cattttgcaa 
caaatctttt 
cttcatttca 
ctcatctaaa 
aatgcaatgg 
tattgaataa 
atacttgccc 
agcactttgg 
ggcaacatgg 
ggcatgtacc 
ggaggtcaag 
tgaaactcag 
gtaaataaca 
attatatata 
caattatata 
ggggagggag 
gattctgatg 
ttagtaccta 
tatattttag 
tataatagat 
gcagcagagg 
caatcttact 
cttcttagga 
tttatgatat 
aaaatgaagg 
aagcttccgt 
gacagaatct 
agctccacct 
caggcgcccg 
ccctgttagc 
aaagtgctgg 
atttccttcc 



atgctgggca 
taaatatata 
atgcataaaa 
taaaaaaaaa 
aagagtgaca 
cccagaaaag 
gggtataaca 
atcttaaaat 
catatttggt 
gtttatacat 
acagcaggcc 
tggggaaatg 
tctagaactc 
gttctgacgc 
cagtgttgtg 
tatctcagcc 
ggggattctg 
tgtttgctaa 
gctgatgggt 
tcccgtggcc 
tgaagcagag 
tcactctgcg 
tttgaagggg 
ctttgtatac 
gactttcaga 
ttttgcatgt 
tcaagaactc 
catctatgtg 
aagaagcaat 
tccctagtac 
actttgctat 
tcaagaaaat 
gagataaatg 
caagaccccg 
tgtagtccca 
gctgcagtaa 
tctcaaaaag 
gattacatat 
gtggatattt 
tataatcaaa 
aatgcaagaa 
gtttcagaaa 
ttagaggaaa 
gaacaccaga 
aaggtggaaa 
tgagggaagg 
tattagcctc 
atttcttgtt 
ggtccactga 
ttaaaccacg 
ggagtggggc 
ctctgtgtca 
cccaggttca 
ccaccacgcc 
caggatggtc 
gattacaggc 
tttttggctg 



ggcacatagg 
ttcagtcatt 
gctccctcaa 
aacaggtcaa 
gatttcatgg 
gctgggaatg 
agtatgcatc 
tcacttttct 
tttcggctct 
tgtggcttcc 
tgtgaaagag 
gaggcccctg 
agcactcctg 
tggaaaagct 
gtttgttagg 
ttagagatta 
caccactaat 
agctggtccc 
caggcaagag 
tccccaggca 
agagcaagca 
ggagactgac 
cgctttggtc 
ttcaactctt 
ggcaggatct 
tattcttctt 
aggttctata 
gcctcagacc 
gagttcttgc 
atagaagcac 
ttctgagtgc 
cacagtctag 
caggagaatt 
tctctaaaaa 
tctactcagg 
gccatgttca 
aagaagatca 
ataatgacca 
tatataataa 
tatactaagt 
taggaagagc 
ggaagcacct 
gggaaaaata 
caatccattt 
agcagtttat 
actcagatcc 
ccttaactgc 
tcttcctttc 
aaatctacaa 
gcagcttttc 
tattttgttt 
cccaggctgg 
tgccattctc 
tggctagttt 
tcgatctcct 
atgagccact 
attgaagtcc 



attggaacac 
tctaaattcc 
ggactgtaac 
cactgtcact 
cctgagtgat 
atttagacag 
tattcttttt 
ttgaaaaatg 
gtccacatgt 
agctgacaaa 
gcagttacaa 
attttcagga 
actccaaatc 
gggtggagga 
tctctggtgt 
aaatgggttg 
ctttgctagt 
accggcaagc 
gtggacgtag 
ttctgcacac 
aacaccacca 
ggttttgaac 
aatagaaatt 
agtacgttca 
ctgaaggtct 
tctaggtttg 
ggcagactgc 
agttatttaa 
ttcagaggat 
tcttaaaata 
caaaggaaat 
ctgggtccag 
acttacacct 
aaaaaaaaaa 
aggctgagtt 
tgccactaca 
cagtttagaa 
ttgtataaat 
tatactatca 
cttgtcaaag 
acatcctgtc 
gactgggttc 
aaagcaaaga 
gcagttacac 
gcttagaatc 
taaggtggtc 
cttccttaac 
cttatagcca 
tctatctgtc 
tcactaacat 
aggtttttta 
agtacagtgg 
ctgcctcagc 
ttttgtattt 
gacctcatga 
gtacccggcc 
attgctaatt 



taaatacttc 
cagtgtcatg 
cttattaggg 
aagtagcaaa 
caatttggat 
agtgaaatga 
ggaatgttta 
tatgcaagat 
acatttcaaa 
acccctttat 
caggtattac 
ggttgcacag 
caaggctctg 
gaggcaggag 
ccttactctc 
ccagttattc 
tgaacaagtt 
cggtgctagt 
ggtctctggg 
ttggctgtct 
ggaaagcttc 
atttcagctc 
tggtccttag 
gggactacct 
gtgccatgga 
tggcccaatt 
cagggtttgt 
agcctaggag 
gattgagtat 
ttaacattag 
accaagatgg 
tggcttatgc 
aggagttcaa 
attaattagc 
gggaggattg 
ttccagcctg 
gcagatctag 
gtgattttat 
catgttatta 
tagtatactc 
agtgctgtta 
agttaaatta 
gactagcacc 
aggggaaaaa 
tggaaagcct 
tcgtggagaa 
tgcctttgga 
atatttattg 
caatgatact 
tcaataattt 
ttgtatttta 
cacaatctcg 
ctcccgagca 
ttagtagaga 
tccacccacc 
agggctgttg 
ctagtttgac 



107400 
107460 
107520 
107580 
107640 
107700 
107760 
107820 
107880 
107940 
108000 
108060 
108120 
108180 
108240 
108300 
108360 
108420 
108480 
108540 
108600 
108660 
108720 
108780 
108840 
108900 
108960 
109020 
109080 
109140 
109200 
109260 
109320 
109380 
109440 
109500 
109560 
109620 
109680 
109740 
109800 
109860 
109920 
109980 
110040 
110100 
110160 
110220 
110280 
110340 
110400 
110460 
110520 
110580 
110640 
110700 
110760 
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actttttatt 
taagcacttt 
atcttgctga 
caggactcct 
gtgtttgcac 
ataataccta 
gttatgttgt 
gaacccacag 
cagtgagttt 
ttctgggtgg 
tttgaagtaa 
gtcctgaggt 
tgtgtgcagg 
tgccaaatag 
tagaagtatg 
aggccgaggc 
aacactgtct 
tcccagatac 
agtgagccga 
aaaaaaaaaa 
aaaaagagaa 
caggtttttg 
agcacctaca 
gggagaccaa 
actgggagaa 
tcagagtttc 
ttgccaaccc 
ttcagctgtc 
cacttcttta 
tggattcctc 
tgcagggaca 
gctaacaccg 
gcttcctatg 
aaaacctcaa 
taaaccaagt 
aaaagtgcac 
gctgttttat 
gacatgaagg 
agtgcccatg 
caacaccaga 
tggccagaga 
caccttgttc 
aagggacctt 
gttcacaact 
gagatgcctc 
catcctcact 
acttattttg 
catgcacact 
aatgtccaac 
atgcagcctt 
tcatcattct 
taggtgggaa 
gcctgttgtg 
aatgacgagt 
tgcacattgt 
taaaataaga 
aacactccca 



acctcaacta 
agaaagtatg 
tggcacttag 
acagatacca 
ataacctagg 
atacaatgta 
taattttatt 
atacagaggg 
gtcaccagtt 
tcaggaaggg 
atgaagggag 
acatggtgga 
gagatgaagc 
cgccggtcgt 
gagtctggac 
gggtggatcg 
ctactaaaaa 
tcgggaggct 
gatcgcgccg 
aaaaaaaaaa 
atgccctcaa 
actgtggaaa 
tacatagaaa 
agaggagaag 
gaagcagaga 
cttatgtctc 
aaaagacctg 
agagcagcat 
tctcatgagg 
ctgaggtaga 
ttgtcaagaa 
actgagcaca 
tctccatgag 
agagtttgga 
ctgtctgatt 
taaagcccac 
aatctcactg 
tcttgggatt 
tctaaaagca 
ttgttagggc 
acaaattcta 
acaggatgat 
agagagtcct 
tattctgcaa 
cagggctgct 
tctttagctg 
aaaaggacac 
tatgtttatt 
aatgatagac 
aaaaaatgat 
cagtaaacta 
ttgaacaatg 
gggtgggggg 
taatgggtgc 
gcacatgtac 
aaaagacact 
tttgatagat 



gattattttg 
aaagaagttt 
acagtacagc 
acatctaggt 
cacatcctct 
aatactatgt 
tttaatattt 
gcaactgtac 
gagaagactt 
ccaagatcat 
gggaaaagaa 
gtcacagggg 
cagctagata 
cagagagtct 
ccggcgcggt 
cgaggtcaga 
tacaaaaaaa 
gaggcagcag 
ctgcactcca 
aaaagtatgg 
atccagaatt 
gtcaaaggag 
atggtgatct 
tgcctgtctg 
cacgtctacc 
acagcctggg 
ggaggtgaag 
cacatcttgg 
aaggtggaag 
aacttcctct 
aattcattca 
tgctggtgcc 
ataagagatg 
atcttaccta 
ctgtgatcca 
atacttcaca 
ctaaagatga 
aaggttcata 
aaagtgaaaa 
tgagctgaaa 
ggagcctgaa 
ttcaggcagg 
aatgcagatt 
atattcattg 
agaaaggaga 
tgccctccat 
tatataccaa 
gcagcactat 
tggattaaga 
gagttcatgt 
tcgcaaggac 
agaacacatg 
agtggggagg 
agcacaccag 
cctataactt 
atatagcttg 
gatgaagcca 



tccagaaagt 
aattcacaat 
aatcccttga 
atgtttaagt 
cacatacttt 
aaatagctgc 
tcaatctgca 
tcaaattatt 
aaacgactaa 
agatagttaa 
aaatgtgcct 
aaaggaagaa 
aggtggacac 
acagtgagag 
ggctcacgct 
agattgagac 
ctagctgggc 
aatggcgtga 
gcctgggcga 
agtttgtaag 
tatttaggaa 
gcacctgaaa 
caagcctcag 
tttcttgcca 
accatggaac 
aacttgaatt 
ccttctgttt 
gatcccacct 
tctgagctga 
cactggaagt 
ttcgctggat 
agaccctgtg 
ctgttatccc 
agatctcata 
tattcttact 
aagagacttt 
aggcattttt 
tactctcaca 
ttaaatcaga 
agctcctaag 
atgtgagatg 
ttccttagat 
gttatcaatt 
tgaacttgct 
aggaggaaca 
tgtctgtgtc 
aggactataa 
tcacaatagc 
aaatgtggca 
cctttgtaga 
aaaaaaccaa 
gacacaggaa 
gataagcatt 
catggcacat 
aaagtataat 
aattgctctt 
aaaaagataa 



ttactgagca 
acttatccta 
tatccacagg 
cctttatata 
aaatcatctc 
cacactttat 
gttgtgaatc 
tgagagtggt 
ctctataata 
agatccagag 
gagaaatagc 
ggctgtcggc 
ggaggcactg 
ccatctcttc 
tgtaatccca 
catcctggct 
gtggtggcgg 
acccaggagg 
cagagcgaga 
gataggtttt 
agaaatagcg 
tttcagacca 
agtgaaaaat 
ggctgggggg 
cttgctgcta 
aaatgagctt 
atttcaagtt 
ggagcatcct 
tcagatgctt 
tttcaagtag 
gaacagaatg 
tttagcactt 
cattttacaa 
actcataagt 
ttttaacttc 
ttttaatatc 
ggtttctgac 
tgtgtgagtt 
gcggaaaaca 
taaactagtt 
ggagagccag 
gactgaagtt 
atgctgtaca 
atgtgcctag 
ataggacaaa 
atgtcagcat 
atcatgctgc 
aaagacttgg 
catatacacc 
gacatggatg 
acactgcatg 
ggggaacatc 
aggagatata 
gtatacatat 
aataataaaa 
agacagcaat 
agtaatttgc 



tcttctaggc 
aaagatcaca 
ggtttggtac 
acatggcgtg 
tatattactt 
tattttgatt 
tgcaaacgta 
gcaagcccgc 
tagagattta 
aaaatgggac 
ataagtaaaa 
ttgactaaag 
cgcaccttgt 
ctgtttgtaa 
gcactttcgg 
aacatggtga 
gcgcctgtag 
cggagtttgc 
ctccatctca 
caatcagtag 
cttttccacc 
ctatacccaa 
gaacatccca 
ccaaagggaa 
aaaattgatt 
tgcacagaga 
taactgtagc 
agcctcgggg 
gatgatgaag 
aggaccactt 
atgataaaca 
taagtatttt 
gcaagaagcc 
ggtaaatatt 
ttatttccag 
ttacagccct 
cttttgctgg 
tcatgtcctc 
cctctctatt 
catggccaag 
tttgcaaccc 
tttgaattgt 
gaattataga 
tattggccag 
gttgttctct 
tgcagttaag 
tataaagaca 
aaccaaccca 
atggaatact 
aaattggaaa 
ttctcactca 
acacactggg 
cctaatgcta 
gtaactaacc 
taaaagaaaa 
gttctattct 
ccaaagtcca 



110820 
110880 
110940 
111000 
111060 
111120 
111180 
111240 
111300 
111360 
111420 
111480 
111540 
111600 
111660 
111720 
111780 
111840 
111900 
111960 
112020 
112080 
112140 
112200 
112260 
112320 
112380 
112440 
112500 
112560 
112620 
112680 
112740 
112800 
112860 
112920 
112980 
113040 
113100 
113160 
113220 
113280 
113340 
113400 
113460 
113520 
113580 
113640 
113700 
113760 
113820 
113880 
113940 
114000 
114060 
114120 
114180 
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gccatcgttg ctagctagtt gccactttac cccagtacct cctacagccc tgggctacag 114240 
ttcccacatc tctacgatgg aggaagggga acagactctc tctagcagtc ttttccaagg 114300 
ccaaagtgtt ttcattttct gaatttcttg aacatccact gtttggccag tattgtcctt 114360 
agctatctaa tagtcacaac ccttaagttc ttttctttct gtccctcgtg ccctttgtct 114420 
tcaggctggt aaactgtcat agatattgat ctgagctttg tttaatcatg ctaagcttat 114480 
accatgggtc ccagggaaac taggaagatg aacagtatat aacaattaca attaacttgt 114540 
catttagcat taggtatatc tcctaatgct atccctcccc tgtcccccca ccccacaaca 114600 
gtccccagag tgtgatgttc cccttcctgt gtccatgtgt tctcattgtt caattcccac 114660 
ctatgagtga gaacatgcag tgtttggttt tttgtctttg tgatagttta ctgagaatga 114720 
tgatttccaa tttcatccat gtccctacaa aggacatgaa ctcatcattt tttaaggctg 114780 
catagtattc catggtgtat atgtgccaca ttttcttaat ccagtctatc attgttggac 114840 
atttgggttg gttccaagtc tttgctattg tgaatagtgc cacaataaac ataacgtgtg 114 900 
catgtgtctt tatagcagca tgatttatag ccctttggct atatacccag taatgggatg 114 960 
gctgggtcaa atggtatttc tagttctaga tccctgagga atcgccacat gtgccctaaa 115020 
acttaaagta taataataat taaaaaaaaa cagttaccag tatttattga gtgcctaagg 115080 
tagtaaaggc ttgagaagct ggagcctatt ttcataccga atatgtaaat atcgacatct 115140 
ctgctctaga agatctagga cactctggaa aagtgtctgt atatcttata ttaggagtag 115200 
tggcctcacc aatgtggtgt catggttagt aaaggacagg ctgagtgaac aagaagggac 115260 
tgatcatgtc tttgccagtt gccaccgcct agctacgttc ttgggcatgg gacctaattt 115320 
gtccagcctc aactttcatc tgtgtgacag gtccaataat tccccatacc acagggatgt 115380 
tgtgatgatt aaatgataag cagagatttt agtacaggat tggatatata gtaagctctc 115440 
cataaatggt aatactatta ctatacatat acatgtattt tacataccta tatatattta 115500 
catacattta tgatctagtt tataatctcc aatatgcctt ggaagcttca gaatgagtct 115560 
tatttcattt ttgtaatgtt actatcatac acagcaataa cagaaacaag ttctaaaatt 115620 
tctccctgga gctttacttc aggagtttgc taccaattag aggggtgggg ggaagaggaa 115680 
gagatgggag aaagagaatt tcacctgttt tagtctctat atacaaataa agtgggtgac 115740 
ccctaacttt gccaagctct tgcctcagaa tgaaaccaca cccagtgcct gtaacttcct 115800 
aatcaaacac gtgagttacc ttcaagggtt acaaaatcat agttcatgga tgaaatatgt 115860 
gtggcccaaa cagaattcta atttgttaaa aaaaacagtt gctaaccttt caaaatacaa 115920 
gaatttcatt tgaaaatcta ggtttctgtc ttctcactta cttttaactt tcttaatctg 115980 
acctacatac tcaaggcaaa actatatagc acactgtttt atcacaaagg taacactcag 116040 
ccaagtcaag aattgtagcg ttgtcagccc ccaggaatgc tcttgtgccc atccctagct 116100 
ctatacactg tctctccaaa cagaacctct atcgggattc atcataacat ccttgttttt 116160 
ctttagaact ttaccaccta agcatgcatt tctaaatgtt agagtttaat tttgtctaca 116220 
ttttggaatt cagaatatac tcctgacctc acggtaggaa aaatcttctt aaacaaaata 116280 
taaaaccatc cataaggaag aaactgataa attgaactgt attaaagtta acagtttctg 116340 
tcctataaag caggagatac cattaagaga gtgaaaatgc aagctacaga aaggaacact 116400 
cacatttaga atacctaaaa tcagtttaaa aagaggagag acaaccaagt agaaaaacag 116460 
gcaaaaggtg gccaggcgca gtggtcagga atttgagacc agcctgacca acacggcaaa 116520 
accccgtctc tactaaatat acaaaaatta gccaggcctg gtggtgcaca cctgtaatcc 116580 
cagctactag ggaggctgag acaggagaat cgcttgagcc caggatgtgg aggttgcagt 116640 
gagccaagat tgcaccactg cactccagcc tgggtgacag agtgagactc tgtctcaaaa 116700 
aaaaaaaaaa gagatgagct attaacactt aaaaaggcac atcaacattt caataagcat 116760 
ccagtatgta tttatgtgtg tatctgttta cgtttgctta atataacact cataggattc 116820 
atcctcatga ttgtagatag gtgtagttta ctcattttca ttcctgtata ggattctatt 116880 
gtatgagtat gccaaaattt attcatccat aattctactc ttgatggaca tttgggccgt 116940 
ttacactttt gaataatgag tctataaata ttcttgtata tgtgtctggg tgcacatgtg 117000 
cacatatttt ttgttggata taaacctggg agtgttatgt gaatgttcag ctttaataga 117060 
tactgccaaa tagtttttca aagtagctgt accaatttac actcccacca tcaatatgtg 117120 
acaattccca ttgctccaaa ttctctcaaa cacttgggat tgtaagtcat tttaatttta 117180 
gccctcctgg tgggggtgta gtgataactc attgtgattt taatttgtat ttctcttgat 117240 
tatttgtgag ttttaacttc atttcatgtg cttattgaaa tgctgatatg cttttttaag 117300 
tgataacagt taataccttt tttaaaaatt gattttaggt attctaaatg tgagccttca 117360 
gtcaaatgtg tatgttgcaa atatcttctt cctttccata gcttgcattt tcactctctt 117420 
aatggtatct cttgctgtgc aggacagaaa ctgttaactt gaatgcagtc caatttatca 117480 
gtttcttcct tatggatggt tttatatttt gtttaggaaa acttttccta ccctgagatc 117540 
aggaatatat tcttctatat taacttttgg tggccttatg tttttccact taaattaatg 117600 
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atccatctgg tactgatttt tatttagggt atagtatagg ggtctagaat caatttttct 117660 
acatagatat ccagttgtca gcatcattta ttgaaaagat caccttttct ccactaaact 117720 
tcagtggcat cttggtcata aatcaagtga ccaagtatgt ctggttctgc ttctggtctt 117780 
tcttttctgt tccactggtt tgtttatttc tcaggtactt agatttaaac tgagcctttg 117840 
tatctggaac aggaagcttt tccccttttt caaaattgtt ttggctatta ttagaccttt 117900 
acatttacat acaaatttta gaaccagcat gtcacattcc aaataaaacc tacagaaatt 117960 
ctaattggga tataatttaa ttacaatcag tttgggagag ttgatatctt tacaatattg 118020 
agtcttcaaa tcccagagca tagtatattg cttcaattat taagatcttc ttttattttt 118080 
ttcagtaata ttttgtagtt ttcagtacag atcttttgtt agatttattt tgtaggtatt 118140 
caatgttttt tgatattata aaaagtgttc ttaaaatatt atgtcataat tgtttggcgc 118200 
tggtaaatgg aaatacaaat gattaatttt atattgattt tatgtctcgt gaccttgcta 118260 
aattcacttt tttctacagt ttttctttct gtgaatacag tcatgtcatt tatgggtaat 118320 
gagagtatta ttttttcctt gccaacttga ccataatatc tccttattca gtttgttagt 118380 
actttaagat gtttttcatc ctgattatga aataataaag tatgtgtaat tttcttttct 118440 
tacaataccc ttgttcaatt tggtattcag gttatgctga cctcaggagg aaaataaggc 118500 
agtgctccct cttcccattc tttaaaagaa tgatataaac atatcacaaa agatatatat 118560 
atatatatat atatatatat aatcacagtt tctttatcca ctcgttgatt gatgggcatt 118620 
tgcattggtt tcacattttt gcaattgtaa attgtgccgc tataacatgc atgtgcaagt 118680 
atctttttca tataatgact tgttttcctc tgggtagatt tataatctca ccttctatac 118740 
tattgacatt ggtggatttt ttgcttggtt gctctcatat tttctctcaa attattgagg 118800 
ggttggtttt gactttgttt atcttgtcta ttgtacattt atttcattaa cttctacact 118860 
ttatttactt cttcttttct atgtttaatt tgcagggctt tttcctgact tcttaaaatg 118920 
gatgcttggt tcattgattt ttttcatatc ttttcttctt ttctaatata tgcatttgta 118980 
aagccttgca tttctctcta aagatagact taaccacatc actattttct tggcttgttt 119040 
cagatttact aaattttaaa tttttttgac tttttactcc tgtattaatt tgaatattag 119100 
gcacccctta actgttcttc tagtgcttac actgatgatt gcaacattca ttcttggcat 119160 
atcaaatgtt attgtaattg ccacttacac ccttttccca gatggtacaa agaccttgaa 119220 
tacttagttt tatttattta actgctaatg tacatgttat tattgttgca taatttttat 119280 
atttttacag tcgacaatat ggtattattg ttgttgctgc tcttttattc agtcagcatt 119340 
atttatattt acccatatat ttataatttt tattaccttt tattctattt gttatctcaa 119400 
agtttccatc tggaataatt ttccttcttt gtgaataata tcattctgta tttccattag 119460 
tgtgggtctg ttgttgacaa ttttttttct ttttctttgc atcaaaatgt ctttatttta 119520 
cattctctat ggaaaaaagt ttttgttgga tatgtagtct acatcacagt tattctctgt 119580 
cagcacttta aagatgtaat ttagttgttt cttgtgaagg gtcagctgtc tgctttattg 119640 
ttgatctttt gacagtaatc tgtctgcctt tgccccactt ccaggtacag agttactgtc 119700 
tttggttttc agcaattttg caatattctg gttagttgcg attctttttt ttaatttaac 119760 
atcatttggg tacattggac ttctcaaatt tgtgtcttga tgtcttttgt caggtttgga 119820 
gagcttttgg caattaattc ttcagacatc atttatgtgt ccttcttttt tcactttatg 119880 
aaatttcatt tacaatgatg ttaaggtgtt tcactgtaac ccacattctt ttccatattt 119940 
tctatcattt tctctttata tgtttcaatc ctgttatatt cttctgatct atcttgatgt 120000 
ttgctaattc tctttttggc tgcacctaat ctgccattgt tcctgtctat tgaattttag 120060 
atgccagtca ttatactgtt caattttgga aaatttttcg tggttctttc acatggtttt 120120 
cagtcctctg ataaaattat caatccttca tttaatctcc tgggagatag taagcatagt 120180 
tgattatgaa atttatgtct gtaatgtctg attactctat tatcttgagc ccctgtgggt 120240 
ctattcctat tttctctgct tttctgttca ttttaattta tgttgtctta tcttcctgta 120300 
taacagtttt tttttactgc gttctggaca ttgaatttgc aaaatttctt tgcagaaata 120360 
atttgaggcc taagctgatg ttatcttttt ccctagtaga tatatgtttg cttgctgcac 120420 
tagtactcca ttatcaggtc aatccaattt cagggattga gatgatttga agctacactg 120480 
caacccttac tagtacctgt ctatttccag ttcagcctta ctcctattgg gcagcccttc 120540 
tgagtcccag cctaaagttg ggtttaacaa gcttccccca ctgcaatcac catattttgg 120600 
tcctggactc caaatttcat ctttctattt ctggcaagct cttaaatgag cttcctctta 120660 
gttgtttagt tgcttactct agaattagta aatatcccca aggtgcaagc agctccaaac 120720 
acaaaggtta cctcccgggc ttcctccatc cttagatccc agccctgcta ttctttgctc 120780 
ttttgttagc tctcctacat cttcaagtag atttgtaaat attttgccca gcttttcttg 120840 
ttgttctcat tgggagtatt cgtccaaatt acatagtctt tcattaacac aagaggaagt 120900 
cctgttcctg atcttgaaaa aaatgataga tatgtccaca ataggctgaa acaaggcagt 120960 
gatttctccc ttcatatgag gaatacattc tcccacaatg tccagctagc ctccttctca 121020 
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ttcaccttag cccagtgact ttggcatttg 
tttttttttt tgagatggag tcttgctctg 
ttggctcact gcaggctcca ccttccgggt 
gtagctggga ttacatggcc caccaccaca 
cgaggtttca ccatgttagc caggatggtc 
tcggcctccc aaagtgctgg gattacaggc 
tcttaaatat tccaataagg tacaatttga 
catgatatgg ttggtaaatg gaggttaact 
aaagccccta tggggttggc atcattctac 
tagtgggccc agaactcctc tatgaggaca 
tttcaatgga aaactggatt tctagcctca 
tggagactaa aaatagggtg acttacaaac 
gtccactttc caaagcaact gccagcataa 
ggatgaaaac aaaccacaga aaatgaggta 
gaaatctaca tgctcataga ctggattatt 
tggaggaaaa ccctacaaag atttccagat 
tgttgttatc gttgttgttg tttaaagtga 
aatttctgag tatagctact ctatgcaaaa 
gtggtcgaga gtgtagtctc cagagtcaag 
ttattagcat tgtaaccttg ggcaagttat 
tataaaatgg aagtaaaagt gctactttat 
gtcatgcata taaagtatct agaacagtgt 
gctatttata attaattttg ttttagtatt 
tcgtgtgtct ctcaaagccc tggatttctt 
tctgaacagg ctgtcttcaa cctggtggat 
tggatatggg gatttccaaa tcttgctgct 
tcctaggatt tgcccacaga gattctgatt 
atattttaaa aagcttgcca ctacgtataa 
gatgacctgg taccaattat taaatatatg 
caaactgatt gttaaaccat tagtagttat 
aactggcaaa tcctacattc tttttttcct 
acaatcttct tagcccctgg gatctggagt 
tttacatctc catgattttc tgttctctaa 
ttcctgtgac tgaaagggct tccagctaga 
cattgctttg ttttgaggca gaggtggaaa 
gatgcataat tgttggctct cacgaagact 
aaattataag atggcatata aaaatccaga 
acaaacaaaa actaagttga aagaagatct 
caaagctacg tgagcactgg caggggaaag 
atgttttccg cctggccttg agcttgcagc 
aaggtgccag actgggtctt tggagatgta 
tatcgtgtga ccttggtcaa gtcattctcc 
ctagggagaa tgcttgcttt gtctgcctct 
tgaccaccag ctgtagaaaa gtaaggaatt 
attggcacct aggtgggaat gaataatccc 
aatagatagg agtgctgaga agggataagg 
aaacagaaat actagtgaat atgctgaatc 
tgagaatttg aagttgatac tattaaccaa 
tggaaatcaa atctgcacat aaattccaga 
tgagcctctg tttcctcatg tatacaaggt 
tgaagattca gtgggataat gcatattaag 
atggcaactg ctgttatcag ctcaaatcag 
ggaagtagtt aggagaatat gatgtattgt 
ttgcataatc cgtaagggac ttcttctact 
attttaccca gcatttgctg caagcttcgt 
cagcaagttc tgagagccag cggcagtatt 
agtgtgtgct tccccttcat aaatccttaa 



ggattcagac tcttttcctt tttttttttt 121080 
tcacccaggc tggagtgcag tggagcaatc 121140 
tcaggtcatt ctcctgcctc agccttccga 121200 
cccagctaat ttttgtattt ttagtagaga 121260 
tcgatctcct gaccttgtga tctgccggcc 121320 
atgagccacc gagcccagcc cagactcttt 121380 
tcatcctagg tttctttcct ctaagtctgg 121440 
attttattat tttgctagag aagaactcct 121500 
tgtctgcatg cctggaagat ctaaaatatt 121560 
gattgccaga ctgctttgca ctgcagagcc 121620 
gttgtgatgt gagaccagga cagtgttgaa 121680 
agatagtgtg tatctcccag ggagtgtgaa 121740 
ggagtgttaa tgccctctgg gatccaagta 121800 
ggatgttcaa ctctcccttt cagtgatcag 121860 
cagaatagac cttgttgaaa gttactggaa 121920 
tccctgggga agacaaaagt atttggtctt 121980 
actcctgtcc tttgctctaa ttcaaggaag 122040 
taatctctgc ctttagggct tcccctaatt 122100 
atgcctagat ttcaatcctt attctactat 122160 
ctaatttctc tatgttgcaa tttccccatt 122220 
ctccaagggt ttttgtgagg attttaaaaa 122280 
ctggcacatg gaaaaactca aataaaaata 122340 
ataattgtat agtaatgctc aacacgttta 122400 
atctcagttc tgcactcttt catttaaagt 1224 60 
ggttcaaaca ggctgtccaa ctttcttagt 122520 
catttcagaa agctttttga aaatatagac 122580 
tagtagattt ggagtgaggc ctaggaagct 122640 
tatgcaacca atttggaatg actgaactat 122700 
gagccaatta ttaaatattc ataaattttg 122760 
tagctatggt aggaatattt acaccatgga 122820 
ggaaagctgg tttactgtcc caccactggc 122880 
atctctgtct gaatctcctt tggttagaac 122940 
ttgaggtttt gatctgccag gcagtagata 123000 
tatctctgca gccataatca agctctccat 123060 
actggtggtc agtggtgatt ctggttcaca 123120 
taaaattaaa gtcggaggca gctttttaaa 123180 
ttctcacctt ctctttcaaa aaacaaacaa 123240 
ggacctgctt ttcagtggac aaagttcagc 123300 
gtgttctctc accggctcat ttgttcattt 123360 
tcctctggga agcagtgtgt tctgttggaa 123420 
ggtttgtaat catgactctg ttttttaatt 123480 
ctaacagagc ctcagtttct cacctgcaaa 123540 
ccaaatgtga gatggggaaa tgtgagtcct 123600 
attgctaaat attgattcct atttcctgcc 123660 
agaacaggtt ctgataggag aaaatatttg 123720 
agagagtgcc tataatgaga aaagtgcaca 123780 
tacagtcttc ttaagggaaa tttgggaaca 123840 
cctcatgtca catataagaa aatatgggct 123900 
aggctatgcc tttgtgcaag ttaacttatc 123960 
ggatattaga atctatcttg caatgcattg 124020 
tgcctgagat attttgtgaa tgtccaataa 124080 
gcatatgcaa attgaactac tggcaaatag 124140 
tagtggatat ttccttatca gaaaagaact 124200 
tcccattcca ctgaagaact tcaatttagt 124260 
aggataccca gccacacata cccatatcag 124320 
agcattataa agtgtttcca gagttggacc 124380 
atcttaggct cactcagctg gaaaaaaaaa 1244 4 0 



100 



CL001205 



aaagaaatct gggtttgaag aacacttatt 
cttttgactg cgcatctgaa tatcaaggca 
aggctggaac caattggtat tcttggagtc 
tttcctcccc ccaataaaaa tttctgttcc 
aatgtaattc tgtttgattt ccactgaacg 
ccacaagcat ttcaattcct caacaccagc 
ttatatccag taatcatgca tttaaccaaa 
atcatttcca gtccattagt cttatttgtt 
gaatatacaa atacttgcat aagtcatgca 
tgggtctgta tctgagttga atattcaaca 
agttttagcc gcttctttcc ttctttctta 
cttcctctcc tttatagtct attaagttgg 
tttaatgaca aaccacaatg gcttttgcac 
tttttttttt ttgtcagcca tcatagaatc 
agaggcattt acataatccg ttcactttag 
ccatgtgcca tcccaggctt tccttggccc 
atcagccctt gcacgccctg gaccctgccc 
tcacactgac ctctccactt caatcttcac 
cttggggcct tcacacctgc ttctctcgct 
acaccttctt cactgctcca cctcctattt 
cttagatcac acaccatacc accctctcct 
ttacgatttg taattttgta tttacttggg 
tagaagacaa gccctataag gacaaagacc 
agtgccatag acagtgcctt gtgtttgata 
gtatctaaca tctgttaaaa acttgctagt 
gtatttactc atttaatact tacaacaatt 
cgttttgcaa ataaggaaac tggtcaagtc 
gcagagtttg tttgtaaccc aggcattgac 
ctactcctgt taagcacttc attaatattc 
atcaattagt agcaaaactg aaaacctaaa 
atgcaaacag agagcctctc aggtaccacc 
caaattcatt tattgctgat aagaactatg 
gaagtggaaa ttattatttt agattcaggc 
agtcaattga agaggcctta agataaggaa 
acagttatta tcattaacaa tttaggcaat 
tgattttaaa gtatgagtaa tcatcagttt 
aaaactccaa taatgagcct tgttcagttg 
atataagcac tcaacagaca tttatttagt 
atcagcaaat acttactaag tactcgttat 
cacaaaagta aaatgtcaga tttcttttca 
aataaccacc acaatgaata aaagctgcta 
caacacattg ggagtctgag atgaaaggac 
agtcaacata gtaaggccct gcctctacaa 
gcctgtgcct gtaattccag cttctcagga 
gcggtcgagg ctgcagtgag ctatgattgc 
aagatcctgc ctattaaaaa ttgttgataa 
ctggggagtt gctttgtcca ggggctctgc 
tatactcgta acaactttaa gatgatggta 
aactgaggtt taaagatgct aatttctgtc 
gccaggattt gaactcaggt cggtttgagt 
cctttatttt gaaagcacta tctgtgggtg 
atcagaggat gggagtggct acagagggat 
ggtctgcata tcctgcttac ctgggtatga 
gtcattcagg tcccagattc agtatcactt 
actccatccc tcacctgcca caatcctatc 
cctggaaggc ttttcaaaca tggtacaatt 
aatccggtgt gtttcctgat tataattctt 



tatttttgat tttctctgcc ttgtgtgctt 124500 
agaagtgctc aagttcctct gggctctgga 124560 
ggtccttttt ggactaaagc tcctttttct 124 620 
agggagtact ttctcagtca ctcctttcct 124680 
gctcctgaaa ggagacactg aaccatgcct 124740 
tttcaggcag gagcagttcc agatgcattt 124800 
atgattgatg cagcattttt tcccctctct 124860 
ttccatggtt tgggttcctg ccaaatcagt 124920 
catctacaca cacatacatg taataggtac 124980 
ttttatttcc aggcttcatg tctccactga 125040 
cttccctctc tcccttcttt ttttccctct 125100 
tgcaaaagta attgtgattt tgccattact 125160 
caacctaaca cttttcccag aaacagccac 125220 
tgatattctt cagcgttaaa ggggcttgag 125280 
tttcagaaaa tcaacaggcc agaaaggaaa 125340 
ttaggcaaac attccagtgc cacgtccccc 125400 
gctgcatcag ccacctctcc acctcagtct 125460 
gctgtagcca cacaggcctt tagtccttgc 125520 
gctcggacag tttttcatac ataggtgcac 125580 
attctttagg tctcagctca aatgacactt 125640 
gactttccac cctttctctt ctgacactta 125700 
tatttagttg tttaatgtat cttcacttat 125760 
atgttttccc cgcccctcaa ttgtttcctc 125820 
gtgacaatag ttagcaatca taaagtgaca 125880 
tccaggcaca gtgtcctatg atttttatat 125940 
atcccgtaaa gtaggtgtga ttattatccc 126000 
actcacccaa gattatatgg ctggaaagtg 126060 
tccaaaacct gaacccacaa ccaggccatg 126120 
attgagagaa taagtgaaaa atatacctta 126180 
ttgtttgatt gataagccat ttctctttct 126240 
agaatgcagc agaaagaaag agaagattaa 126300 
agagggacgc tgtgatatag atcatgaagt 126360 
aggacccagt ggcaggtgga aaaaatacag 126420 
tcagattctc cagaaacatc aagtgggttt 126480 
tcttatgcta tggataacat tttccaaatg 126540 
tatgaaactg ctgctcagaa ttctaggata 126600 
ttccatggca tgtggaacag ttcctagcac 126660 
aattaatggg aaatgaacat caacaaatcc 126720 
taagtctata cagatcaagg cacaacaagg 126780 
tggagagctc ataatccagg tagacccaga 126840 
gctgggcatg gtggcacatg cctgtaaccc 126900 
cgcttgagac caggagtatg agaccatact 126960 
aaaattttta aaaatcagct gagcatggtg 127020 
gactgaggca ggaggatcac ttgagcccag 127080 
acctctacat accagcttgg gcaacaaagc 127140 
aataaaaaat aaaagtaaac gccaccatca 127200 
catgtccctc tgcatatatt aggtcatctt 127260 
cccttcctca tccccatctc tagatgttga 127320 
ctaaagacac agaaacaaga acttagtgca 127380 
ctggaaccaa tgatgccaaa tgtcttcatc 127440 
aggcatacta gaactacaag actttgcagt 127500 
caatgaaagc aacattaccc agtctctcca 127560 
aacactccat ctctcacctg ccacagtcct 127620 
ctgcatatgc tgcttacttg ggtatgaaat 127680 
gttcaagtcc cagattcagt gtcacttctt 127740 
tgattgtttc ttcctttatg ctccttttag 127800 
atctcgcatt ctgattactt gtttcccatt 127860 
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aactagtctt tgagttcctc taggacaggg gtgaccaatc ttttggcttc cctgagccac 127920 
tttggaagaa gaattgtctt gggccacaca taaaatacac caacatcaat gatgagctaa 127980 
aaaaaaaaat tcgcaaaaaa aaaatctcat aatgttttaa gaaactttac acatttgtgt 128040 
tgggacatat tcaaagccat cctgggctgc atgtggctca caggccacag gttggacaag 128100 
cttgctctag gatacaattt tattctccca gggcctagca cagcccctag catatagtaa 128160 
gtcatttaat aaataagtgt ggaacgaatg aattttaggt agaaattgtc atcagatggt 128220 
tcaactaata tacagcacaa ttccaagaga ggagtgtatt cttgggcatc agggaaaagg 128280 
gactttggaa aagttgagtt ggaggccatg tggctattaa gatatctgtc tccacatgcc 128340 
tcgactgact gtgtgagcct tggaaagtta ttctctcttc acttcagtgt tttcatcctt 128400 
ataattaaga tgataaaaac tgggttgttg ggaagttgaa ataacttaac atgcaggaac 128460 
tcagcagagt gtctgttcat tgtaagtact caaatgttgg ccatggctat agctgctgct 128520 
gcttgtcttc tctgaattat tattatggaa atttgggtga gtacagcatt tcaggcagca 128580 
ggtcacttct gagtaaaagt acagaactat gaagagatct cctacccttg gatataaaaa 128640 
caattcacag cattgccaag aaagactgta gatattcagg gatgagagtg agattcctat 128700 
gagccagaga atcaaaagct ctgtaaatta gtgagttcgt gatatcatgc tacgctcctg 128760 
tttttaagag ctagatatca aaataaatgg tccagtgtat gtgcctgatg gctatgtctt 128820 
ttccactgta cacacaagta agattctgca tggtgactat cattttgttt tcaggaagta 128880 
gatgaatcag ccgtttggtt ccctgggaat gtgggctgct ggcttttaat tctttttggg 128940 
gaagatttct atgtcatcag ccatcttgtt tatctggaaa catgggacaa acgttggctc 129000 
tctttctacc tatgtagtcc tttgctaggg ttgtacagct tgagtagaat tgggaaaagt 129060 
gatttggatc cttgtgactg ggattaggga gaattggtac atgagtatac agaatctttt 129120 
cattgatctc cttgcctctg accttgattt agccacctac tttcagtgga taaggcccta 129180 
cagaacaaac caactgaagc atactttagg ataggccaca cctctaatct catatttatt 129240 
atgaccaact aaaaactgct tatccttctc tttatgtcta ttatttaact ttcccctcct 129300 
ctctaccttc tttcccttaa gtccttctga tacactacat gctcctgcca ttgtgactat 129360 
gagtctaaaa agctggtggc attacccacc ttcccctcca ccttcaaaac tcacatgtac 129420 
accatgcttt gggcttctct gatttggctc actatttctt caacttagaa cgcccacttt 129480 
ctattatgat cattaaaaga tcctggattt atgtgtgggg tggatgtagt aacaactggg 129540 
aaatagcata aaattatttg gtttcttgta tgctggtttt cttcccagag gtaacctgtt 129600 
tggtttgacg tttggcttaa tgttgttaat ttgacgaagg tggaaatgga tgggctagaa 129660 
atctttgtgg gactaggata aggcttgaat ggtctaatct agggattcaa actttctagt 129720 
taagaatgta caacctgtga ggataatttg cattttattc tacatatatc ccaatttaac 129780 
cagctttcag actttaaatc accataagat tttggaaaac ttctactgtc taaatacaga 129840 
ataaagtttt ctaagccaga tgaaaaaaat tgcatcattc ttgcaccctg caggcaaatg 129900 
catgttgact gcaaagctat ataaatgtgt gttgagtgat ttaacaggtc tggctattga 129960 
acttgttaaa aatggatgtc aggaaagaaa aacagcaggt gggtgtagac tggaggagtg 130020 
ggcaatgcct gctgacagaa agaggagtga cagctgacaa ggagggaagg cccaatggag 130080 
acttcaggct caccacgtat ggagcagtct ggggaccgaa acaaagaaat gaaaacaaac 130140 
agagcaccct accacctcaa gaatccctaa gggagttata atatgcacaa atgctcatga 130200 
ttattaacta ggaaccagtt attctatgcc tgctatgagc caggattcat tccaggaaat 130260 
taaagagaat agaaagaagt ataaaataca gtgcctaggt ctgcaagtat ttattacctg 130320 
tgacactact atgttggcca aggagaagtt ctgtatatta tgctcttttt ataaaagcag 130380 
aagcatgtat agaaactatt aggatatttt tcttcaaaat gtttaaaatt ctttagctgc 130440 
ttacaagaga aatttattta gtactacttt ctcaaagatc ccatttcctg acaatgctaa 130500 
taatggaggc atttcaagat aatgacctca taaaattcac acaactgtct gtctgtgcag 130560 
acttagatag cacagtcatg acctcactta ggtcagggac agatcttagc ttgatagggc 130620 
aatggtcaga aatgcagttt ggtttaaaca gttttttcct cttctctgtt gactcattta 130680 
atgaaagaaa gtgtctcatt caacctgcat atttttttca aagattccac aatgatgggg 130740 
ccatgtctcc tatgagttat tatacaaaat ggacactgag tcctgaggta cattaggaag 130800 
ccccaaactt atccttgtat ttaattcatt tgaacatttt ggatatttta ttttatcaag 130860 
ccagtcattt cgagttgaga agtccgaaga tcttactcat tttagaaaga ctaggttagg 130920 
gcattttctc tggggaaaga cctcttctgt aagggaactt caagaaaggg agcagatggg 130980 
ctctgtgtct actgagagac aggaggataa tcaggcagtg acatagccca tggtactgga 131040 
gagaaagaca aatcgctatc cacataatgg aaggcatggg gtagggtgag ccaaatggaa 131100 
ggcttataaa ggcagcaagc aatgaggact agccatttat ttaaaatgga aaagggacaa 131160 
ggtatcttta tcccacttta tcccactgtc caacaaaatt attttaaaaa aaagagagag 131220 
agagagagaa gatgaagaaa ttaagagtgg gaaaatacta ctacaaccgt ttggcctttc 131280 
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tggcctctta aaatctgagg caggtcaggg aaaacatggc aagaacacta aacttaaaga 131340 
gtcaggaaat ggatgctcaa gtgttgttgc tgcctctgac tcagccttac tgtgtgacct 131400 
tgaggacaat atttccccaa actccacccc agagcactta ccgcttccct gtagataaat 1314 60 
ataaggaagt tagaacaagt ctctacagtt ctttccattc ctgacattct aggcttttat 131520 
aatgagctaa aacaagaagc ttgtttttta atgtgtgaca caccatgaaa tgtatacaga 131580 
gaagtgacct ataaattctg attaagtgtg aattagatgc aaatacaaaa tgcaagatta 131640 
gagtgaaaag cattacccct ctgtaggtca ggaagtataa tgtggtttct ttgcactatg 131700 
aataatgaat attgtacatg ccgaaatgag cactggaaaa catagaagga actagatgct 131760 
cttctagaat gggctttctt ccctttggga tagacttttt ttttttttcc tatttttgca 131820 
cccatgctta gagaggacac aaacatctct gagattctac ttgagacaat gcgacaagca 131880 
cttaaaggaa caccaaatca tttgagtgca caggcaatga caccatctgt agacttatct 131940 
gatccagaga acctgaggtg gagcctatag cctctggtca gttggaagcg gtggggaaat 132000 
ctcaagaaca cttgtcctga gagaaagaaa actttatctg cttgttacac atgagcctga 132060 
gctgaggaaa tagacctgga tccagggaat tcgtgtttac tgggtgccac aacttgtgtt 132120 
agcacttgac atgatttctt ctatttaaat attagactaa aaacactcta tcagactagg 132180 
gttttaaaat actatttcat ttttttggta cctatttata aaatggggaa actgagtctt 132240 
gggggatatt aaggtcttat agactgtaaa tggtataatc caaggttgaa aggtttcctg 132300 
actctaatgc tggttctctt tcacaatacc tcattgcctt ctggaagttg tagttcctac 132360 
cctagttctg cctttatttg gctgtggtaa tcaaaacaaa tcacttaagt tcttgaatct 132420 
cattttgttc ttctgttaaa tatcacaaaa ccaaccaacc aaccaaaaca gtgagaatat 132480 
agctttgtta tgcttccatg atttgtggat ggtcttcaca ggcaacaatt aattcatgag 132540 
gaatgatgcc cactgctgtc atgcaacata gtggccctgg attttaagga atccatgtat 132600 
taatacatgc aacctgaaat cacatatgta aattgtatta tatacttgca tatctatatt 132660 
gttttagaga agaggctaaa gctttcttga gaggctttct gtggcatcta tatccccaaa 132720 
ttcctaaaaa tcattaggtt cttggcagct cattaagtga ttagtaggtc tccttatgat 132780 
gtgttataac tcaaaacatc agtaaccatc tgaaagaaat taaggtttag gacaggcatg 132840 
gtggtacaca cctgtaatcc cagcatttgg ggaggtaaag atggcagtat cgcttgaggc 132900 
caagagttta agaacagcct gagcaacaca gtgagaccct atctttacaa aaaatttaaa 132960 
attgttatta aaagaaatta aagtttagta catgaaagca gctgaaactc agaactgacc 133020 
cttacatcag aaaccatgtg gtatcatgga aagaaatctg gcaaagaatc aaagattctg 133080 
ggtcctactt tactctgctg tgaacccact ccatgaatct gggcacatac ttgggccccc 133140 
attagtttgt cattcaaaaa gagatgctaa gttccagcct gcctctccca aagcactttt 133200 
taaaagaatc aaatgtgata atatggatga aggcactttg gcaaaatatg aagtaccctg 133260 
caaaagtctg atattaacca tgagatatta aagtatcaag tcatttcact agttgtcaac 133320 
tgagaaaaag ggaaaattgc aagttccatc agcaaaattt agaagccttg ctttttcatt 133380 
ccttcagcaa ggtcctacag ctgatattta tgcataaatt ttcttgactt taatgagaat 133440 
tggttgcaaa tacaccttac aggattcaaa tggagatcat caccattcta ggagctgcta 133500 
aacagaacat gtggcttctt ctctagccaa gagttctcct cttttatcac ctttatttta 133560 
tgatcagtgg ttctcaagga atggtcaagc accagcagcg tgagcatctt ctgggaatat 133620 
gttagaaatg aaaattctcg agccccatcc cagacctact aaatcagaaa tcctggaggt 133680 
ggagcccagc aggctgtgtt ttaacaaacc cttaagagga ttctgatgcc ctgcacactt 133740 
aagtgtgaga accactgcca taagtgagta tccttggaga gacctacttt ggtcctgggt 133800 
actttaagga aaatcgtggg gccccagtaa tccaaaagag tacctcatct aagtctctga 1338 60 
agggctgatg ttagagcaaa ggttgggcta gtgaatgtca atgttagcaa acatggtggg 133920 
tgtgacccaa aacataatca aataggcctc ttaggttaaa gtcctgatgt taggtttgct 133980 
ggttgagaag gaatacaaat gtatctcaag gaatgcagtt ctctcaagat tcagaaagta 134040 
tggatacctt tgccatgcct ggcagcttga aagaaatagc aatgtaaagt taaaccacgc 134100 
ctatgtgaaa gttagctcca tagcaggctt tcttctctga gatttgaatt tatggaacat 134160 
gataacaaat atgaaaatag ataattttta ttgaatatca tatcactacc actattttaa 134220 
gtgatttgta tgtattaatg gctaaccttt tttcaggtag ttactccacc aagctctttg 134280 
ctgggccctg aaaatgtaac agtgaacaag aaaaaaatct ctttcctcaa ggaactcaca 134340 
cttcaatgaa gggaaattag aagaaattag cctattactg ttcaagttca gggtgtttgc 134400 
agaggcctag atgcggcacc tacactcagt cttatgtaaa tagaggtttc tcaaaggaag 134460 
aaatgtataa atttagacct ctgtagagtg agaggagttg atcactttag agtggagaaa 134520 
cagatttagg ataactatac tctagatcac acagcaagta agtgatagtg ttgggatttg 134580 
aactcaggtt ggtttcatcc aaagccattg ctttcaacca ttaaggcaag ggcagaaaat 134640 
gggttttatc aagcctgtca attctgacaa attaataatg gcttccaaga atgtggatgg 134700 
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tgaatactcg gtgagcactg ggctcaattg 
gtcttccctg ggttcagaga ggagggtgta 
agagtactgc ctccaggact tagcacgaga 
gtaagagtgt ctatgacttc aaggaacctt 
atatgttagc aaaggtagca agtgacagcc 
aaagtttcct gtcaaaagag aagacactaa 
agcaattggt cttactaaga gactaaactt 
ttttacctct gttagttcta tcaacatgga 
aaataatggg atccatatgt attgagtcag 
taaaatgcca cccaaaatat atgcattcaa 
gggaagtgat atcaatcaga aggattaaga 
aattagtgcc agtttgcttc aatatgtttt 
ttgaagtctc gttgaatatg gttgtttctg 
gaaatcatac taatgacaat aactgctcct 
tcaagttagt cagtgcttgg aaatggcttc 
tcactttccc cagctcacct tcacatttaa 
taacagatgt ttttaaataa ctcagttatt 
caaaagtccc aaggtcagag ctatttgcca 
tgaatgatca cttctctcta accgtgactc 
agaggctcac cttgggttga gggtgacttt 
agggctgtgc ctgcttgtca tgcttctcta 
gaccagtttc tcactgatcc tcctcccctg 
tttccccctt gagacttgtt caccttgttt 
ggaatcaaaa catttctctg atcacgtcat 
ttgtacattt tacaggttat ttgaaacctc 
gtgtagaaga ggctctggcc acagaccgaa 
attagataat cacgtggaaa actgtaagaa 
ttagtatctt ctgtatgcag attccaacat 
ggactccagg tttgagtcca tgttaaatgg 
atttaatgct tttaatacat tcaggactca 
attcacggca gcagagacat accaagtcag 
attcatgtgc cgcagccctg gttggctgct 
ttttttagca atattgctgc tgaagcttag 
tttctgcctc aggtgtgtgg tctagttaat 
gaatgctggt tgtgaggaga tttgatttct 
ttgggcaagg ctctttcctg gtcccagccc 
aggttggtct ctaaggagcc tttctagccc 
ttctgggttc agttgagttc agtgccaccg 
caagcaccat acggggcact gcaggggata 
ggcactcacg ggcaaaggga gagtgggatg 
agagcagact atgttgtatg ctggaagcaa 
cagaaatttc atttatttat gaaaagtcag 
ttgaaagaga aaattcaaac aggatccagt 
atattatgtg ccagccaatg ttctgagcac 
agtcttttga gattaagact ctggttatct 
actggctaag gaataggctt aggctcatgc 
gcttagcgat ctggtgccta cgcccgtcac 
aaacaaggcc accgaacctg gagggaggga 
ctgggaaagt cttttcagcc tctgaccttc 
aatagtttcc tcgttgggtt atgatgattc 
aaatgattaa aagttctaga tgattttaaa 
caaatgctga gaggaggtaa agtgtgggac 
agggtctctt ctccacagtt tgagaggttc 
tccagctatg ccaggccttg actctgctct 
tgtgtgtgtg tgtctgtgtg tgtgtgtgtg 
tgttgtgctg ttgaagctga atgccagcac 
gcatttgaaa gaaagtatag acccttagct 



gaaaaaaaaa tgccatgatt aattaataat 134760 
tgtgtcatgc atttgcctac cctgcagtac 134820 
ggatgaaatc tgcggttgtt tcttattcat 134880 
agagctcaat ggcatcagca ggggcttatt 134 940 
caggatggag cattcagtaa aaaagagaat 135000 
ttaagtttaa cagtgaaaaa caaacaacaa 135060 
caaaatttgt aagccaattt cattttcatc 135120 
cggcgttaat atgacagaat atttgtttgg 135180 
cttcatcacg cccaggggaa acttaaattt 135240 
catgcataat ggctcatctg ttgaatagtt 135300 
agatggctgt ttaaggcaat gatgataata 135360 
atggcgtcag taagatggta ggtgagctct 135420 
cagcacattt gttaaaccca tatatggatt 135480 
atgactcaaa agggaaacaa atggatactg 135540 
tggatgaatt tcctttgcaa aaagtctctt 135600 
ccgtaataag cactctttac tctagacatt 135660 
gggtatataa aaagaagaag atgacctccc 135720 
tctgagcaat tgtccccagg aagaatgttg 135780 
agcacagcac accaacctgc acccattttc 135840 
gagtatatgg gcctcagcag tcaccgccca 135900 
tcaccccacc cacctgcagc catcagagag 135960 
atgcatttac atgagagatg ggggaggagc 136020 
tactttggaa gacaagattt tacagtacca 136080 
gctgaccagt gctaaattat ctctgattca 136140 
aacagggggg aaaaaaattg atagtattct 136200 
aaggacttta tctttactca tccctaccag 136260 
acatcttcaa catacaagaa acatgcatct 136320 
ggagaaagtg ttctagagcc taaggtttga 136380 
aaggaggaag agaaccattt aaaggtttgc 136440 
gtaacgtctc ctgtgcagag ctcctgatcc 136500 
cacagagaag atgccttggc tatacaattc 136560 
gaatataagt ccctagtaca tctctatttt 136620 
ctgtgtgctc tcgtgtgtcc catcctgctc 136680 
cctccattcc atgggagaaa catagcccag 136740 
actcctactt ctgccattaa ctgtatgact 136800 
agccccagct attcagcatc catttaggat 1368 60 
cagcattcaa ggacttagtg gaaactagaa 136920 
gcatttgccg actgacttcc tctttgtcat 136980 
tgtttatatc agagctctct ctgataccac 137040 
agaaaaacaa gtgtattgat ataccagtgc 137100 
agtacaaatg attatagggt ccaaaggaag 137160 
aataaaactt catggcattt cagatcagcc 137220 
gaagtttcta tagcaacatt tactgagcaa 137280 
tttgcacaca gtaacttatt tattcctctc 137340 
gattctgtat atgaggaaac tgaggcatag 137400 
cacaaacaag cacaagcaca aggattcaaa 137460 
ttagtgtgga gagtcaactg actgtcacat 137520 
gacctgactc tgcagatgag ctttgtgtcc 137580 
acgtttctta actctgaaat gcgaatgaaa 137640 
agcaaaatgg caggaataag aatagatttt 137700 
taggagagaa aagcagagga gctgattgag 137760 
atatttaggg cagagtgatt attcagtatt 137820 
ccacctgaag tcttgttcca tctcatctct 137880 
ctctatgtgg agaatgacaa atctagtctc 137940 
tgtgtgtgtg tgtgtgtgtg tgtgtgtatg 138000 
aattctcatc agcatttgag ttcattggca 138060 
ctcagtgcta atatagacat tggcacatta 138120 
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gtttcagctg 
acttctctgc 
catcttattg 
aatgggaaag 
gacagtagga 
gattggtcct 
gcaccttcgg 
cagctattct 
gaggtacaat 
tataaaacaa 
tacagggcca 
cattgggtgg 
taattaactt 
ctttatcctt 
gttccaggtg 
tttttttcag 
aaaagaatgg 
ggtctctact 
tgcatatgcc 
ttgcccagca 
aaaagttgta 
atcactcaga 
ctgagacaca 
aggcaggcta 
ttaggtgtag 
taagtgtgct 
ttctgactgg 
gagagataga 
tctatttgcc 
cagactgacc 
ttttcctctg 
acattacaac 
tattgtctgt 
ggctaattgg 
tgcatttgtc 
gtcttgcttt 
attgattcct 
ctttgttgtg 
tcagttagaa 
ctgctgagaa 
taccaatgag 
caggcagggg 
ggatggtttg 
aatgcctttc 
ctgaaacatg 
tgctgactcc 
tggaaacact 
tctgtgatga 
ctcctgctcc 
tgtgcagtgg 
cacttgggaa 
ccttggactt 
tctccctctt 
gcagaattta 
tcgcattgga 
ccacatacac 
tagacatgtt 



aattaactca 
atacagaaat 
gtactggaca 
cagaagggat 
aatacgatgg 
ggaaatgtca 
ttctgatagc 
tagtctctca 
tttccactgc 
taacaatata 
ggaactgtgc 
gtactattat 
gcccaaggga 
aaccactcag 
gcctttcagc 
aggaatcagt 
ggcaagcgat 
ggaggagaga 
cccacctccc 
attgttcaat 
tcatggtcgg 
aagcctttgt 
gctcctgtcc 
gcattgatgc 
tgtgaaggga 
aatagcaata 
gactctcaca 
gcagtgatga 
tgtgcttgct 
tggctttgat 
taaatggaga 
atgtttagta 
gctatgacta 
ccactgttag 
ctctcttcag 
tttgtttttg 
ttttataatt 
tgttcagatc 
ttctgtcgtg 
tcatcataga 
aacctccagc 
tgggcctgtg 
catgcccttt 
ctcctcctca 
cagctcgtgc 
ggggccttca 
ttaaactgtc 
gcgacattgt 
cctgaaacca 
tgactaggtt 
cacatgcaca 
ttaaaaacat 
taagggattg 
tttcattttc 
atgaccctct 
tatgttgaat 
gcaaggtagt 



tgtaggactg 
aacatctcag 
aagcaggatc 
gcagcgtggt 
acaagggtca 
tttctataaa 
ctaaaaatgc 
actagaagct 
atgttgctct 
gcctcagata 
aaaaacacgt 
ttccataata 
cgaagcaccg 
gtgtctagcc 
cccagaaatc 
ctttcttccc 
gctctctgta 
ggacagcagc 
cagttctctg 
gactctgacc 
tagaccagag 
tcccaaggca 
tcacaggaag 
aagatgctga 
ggatcagaca 
caagccaaga 
gaagaggtgg 
gattctgctc 
tttccgaagg 
ttctgtgtct 
ttctagtacc 
atgcaagtgt 
tttcagtgaa 
gcaattttaa 
atccttgtat 
atttgtttgt 
tcctcttcct 
cagtagtgag 
tctacagacc 
atatgggaac 
ccagggacag 
tactagtcct 
ctgcttttct 
tttcctcttc 
ttccacaagg 
cagcttgcag 
tcagcattga 
agagagttaa 
aggattataa 
gaagaggatt 
cacgcattta 
gcttgccctt 
cccttccctc 
tcatgcaatt 
tagaggcaag 
tgataaattt 
ttagcaaagt 



acgctaaggg 
cactgtctgt 
ttcagatggt 
taaatccaaa 
ggagaccagg 
ttagcacaac 
atgagttatt 
acatgtgatt 
tatctttctt 
actgcagcaa 
tttacatgca 
cagatgagca 
ttttgaatct 
cagagtagac 
tcccctcact 
cagtcagaat 
aagtgaattc 
gaggagagtg 
agagatattc 
atggaaagca 
acaacagaga 
ctattaagta 
gactcactgg 
gtgccataac 
tagttaattc 
actatgagag 
tgtctaagtg 
atcacagggg 
caggacggct 
gtcatttctg 
ttcttcatgg 
ttaagtacat 
cattatctca 
aagttaacga 
atgagcatct 
gggttttttt 
gttctaaagt 
tgcagaagga 
agctttgttc 
tgcaggcaac 
gaagaaaatt 
caagtgtgct 
tccgcagact 
cctccccctt 
catcctcagc 
cccagttgac 
agaaagtgag 
caaaaactgg 
atactttcaa 
tggaagcggt 
tgcatttgca 
taggatcctc 
tgaatgtgca 
tttattagtg 
ggtcttgact 
acttattcag 
gataaggact 



acagttagag 
gtagcaagat 
gcccagcgac 
aaaagagcag 
atttgagccc 
tgataaagat 
tttaagcaga 
ctgcatttct 
agtggccaga 
cagaaacaat 
tttgattctc 
agcaaaggtt 
ggcttgtcca 
acaggcacgt 
tgaaaaataa 
gaagtcacta 
taatgaggta 
gcatgggaag 
cacacaggaa 
tacgagaaaa 
aactggctgc 
tcaggcagac 
ataagatggg 
aaacgtctaa 
taacacgagg 
cccagcgaag 
ggaccttgaa 
agttggtgta 
acaagcatgg 
tgtgcccttg 
cattgtgatt 
gttcaataaa 
gtcttctggg 
aatgcaccta 
ttgcctattc 
tctatacaat 
cctaccttgg 
cagttaccat 
ctctcctagt 
tgcagccaaa 
gtcaagtttg 
gactcacagg 
taggctgtga 
cctccctgct 
atgcttttca 
agccagtgaa 
aagaaggcag 
gagactttca 
tcttacagga 
acttgggctc 
ctatgtagta 
atcttggaaa 
gaacatgctg 
gctcataagt 
gttgttttcc 
tgaatgtttt 
catgtggaat 



tggagtggaa 
gtattatcat 
tatagggact 
cagctcaaga 
cagcttaggt 
gctctctaaa 
aacttaacta 
tattaactga 
taatgttatt 
agtagtcatt 
agagcagcac 
tcagagttac 
ccatcaaacc 
gagcccaagt 
tctagaacaa 
actccagagt 
aaagtgggca 
tggagagagc 
ttcctgaccc 
agtacgccag 
aagtctcagc 
agagagattc 
aggcaaaccc 
gcagcttgct 
taggaagtgt 
gatggattaa 
ggattggaat 
aagaggaaag 
agctgaagtt 
aactttcata 
gggaacaatt 
tgatagtttt 
tagcatctga 
caacttcatc 
ttggatcata 
agagcaaatt 
atggtccact 
agtcagcctc 
caccactatg 
tcttttgttt 
tacagagaga 
ccagtgccct 
ggagggcaaa 
ttgggtaaag 
cagcaccctc 
gtgagaagat 
gtgggaaacg 
gggtttcctt 
caactctagc 
cgtgtacaca 
ggcatttctg 
tctgccctct 
ttcttctact 
gtgtctttca 
ccatactata 
ggtattataa 
cagacataca 



138180 
138240 
138300 
138360 
138420 
138480 
138540 
138600 
138660 
138720 
138780 
138840 
138900 
138960 
139020 
139080 
139140 
139200 
139260 
139320 
139380 
139440 
139500 
139560 
139620 
139680 
139740 
139800 
139860 
139920 
139980 
140040 
140100 
140160 
140220 
140280 
140340 
140400 
140460 
140520 
140580 
140640 
140700 
140760 
140820 
140880 
140940 
141000 
141060 
141120 
141180 
141240 
141300 
141360 
141420 
141480 
141540 



105 



CL001205 



ggggtttgaa 
gtccctcttt 
ttaaaagaat 
tatgcaagca 
atcatcatca 
atcccctttc 
aagcccaatg 
acccaagaaa 
ctatggggat 
atgctttgaa 
cctcccatgc 
gagattttat 
tagagggtgg 
gtagcagcat 
tctataatgc 
gagaaacaag 
gtccatgtct 
gtccactctc 
gtgttgtgca 
gtcctaaaag 
tcccttcaac 
tcatgcccct 
atttctgctc 
tttcaaacct 
gtgaactagt 
aaggctatcg 
aggttgggga 
gttcctgata 
tatatttttt 
taattaagac 
agcaggatga 
taggggagaa 
agcagcagat 
gcagattctc 
atgaggggct 
tggctgctag 
gggatgaatt 
agcagtggca 
catttgtgac 
aaacttacag 
agtcaggcag 
agcaacttaa 
acctcatatt 
ggtaagctta 
gtcacaggtg 
gggtcaggga 
gttcacagtc 
atgctaagac 
ggagtgaaaa 
gttttccagg 
ggaaatggtt 
atgaggctgg 
gagttatgca 
cttcccccag 
ctgtttcctc 
gggatataca 
tccggaattc 



ttgtagctcc 
tgttaaatgg 
tgatgaaagt 
cctattagta 
tcatcatttg 
ttttcttaga 
tttattactg 
gctactttcc 
cagaagaggc 
caaggcaggt 
tcaaagcagc 
ggctttagca 
aggagaagta 
ggctgtggtg 
agtgttctga 
tcctgttaat 
gttctcttac 
ttgtttactc 
ttagatagat 
gggaacatct 
catctctttg 
ggtgagtcaa 
atatcacctt 
gtttatattt 
gaaatatgag 
gatgaatcac 
aaagcatgga 
tcaaaatatt 
aaataatcta 
tttgagtaaa 
cctgcttaag 
gagttttggg 
gagggaaagg 
aaaggtcatg 
tgaggggatc 
atgaaggatg 
aaagtgacaa 
gagccctagg 
atctttagag 
ctaatttttg 
ctgggtttct 
ctcttggaag 
tgggttatgc 
ttccttcact 
agcagaccag 
caggccctga 
tgatgaagaa 
agaaagtaca 
gctttccact 
tatggagggt 
catatgcatc 
agagtgaatc 
aaagcaggaa 
tttcatagta 
cagaattaat 
aagggaacac 
tgttacttag 



actattatca 
caataataat 
aaaatgtgta 
attaatgtta 
ttgccactgt 
tatgtgaggg 
cgtggctgtt 
ctgtggccct 
caccccaaaa 
ccaagcttcc 
tctagacctg 
ccatctggcc 
atcatcctaa 
gtggtggttg 
gtgtgtgcag 
ggtaacgcca 
agatgaggaa 
ttttacttga 
atgctgagaa 
agataatcaa 
gattcaccac 
aagctcctgg 
tgctggtggc 
ctgtgttttt 
tgcaaaagaa 
taaaatcatt 
agaaaatcta 
gatcaataca 
attgccatcc 
gttctgtgaa 
gtggtccttg 
cagtgagaac 
gaaatgataa 
caaaggagtt 
tgtttgacag 
gatgagaacg 
aggcaagtga 
ccagggtgag 
ggacgtgcat 
aaggcatagg 
ttttctagtt 
ctcatctaga 
agtttaaata 
tgatcctcac 
tacctgttca 
gacttttgaa 
gacagatatg 
ttagaacctt 
ggagtcaaca 
catttttgag 
tatgtgggtg 
aggggcaggg 
aactagtaca 
tacatgttcc 
ttgaatctca 
tccaggatag 
caggagttgt 



gttctgtgac 
aaagtacacc 
gtctattttc 
gaaataataa 
ggctgtgcca 
agataagcaa 
gctatgctgt 
cacctattct 
caataggaaa 
cacctcagat 
cacagcaccc 
ttatctcctt 
aaccaacagg 
cttggtgtat 
aacctatgga 
tttggatctg 
acaaaagctc 
cattgattga 
atgagaaaga 
tgagctaata 
attctccaca 
ccagtagcct 
ataccagaat 
attatacttg 
agtctttttt 
gtaatattag 
aacttatatt 
tgtgcttcac 
acgttgcatc 
ggatacaaac 
caagataaca 
cgcatcgggg 
ggtttttcag 
gggactttat 
gatgtgattt 
gaaaaaggga 
taaagtgcaa 
aatggagatg 
ccatattttc 
tgatctgatg 
tgatatttat 
aaaggacaat 
aatagatggc 
ggcgcataag 
ataaaccgtt 
atagattaga 
aaaatgtgaa 
agaggaggaa 
tctggctggg 
aaggatatgc 
gggcctcagg 
aggctcctgc 
tttgcttcat 
tgtgccttat 
gcctttttcc 
tgagaaatgc 
ttcctgccat 



ccctgcctac 
ctcttaagct 
tagcattaaa 
tgataattat 
ggttctccag 
aggaatatat 
gggcttctat 
tggctgtctg 
taaaaaggaa 
cttcagagtg 
tcaaagaagc 
agctgtcttg 
tggccccagt 
gtgtggaagt 
ggaaatcaga 
gaaataactg 
ttcatttaag 
acctctcaat 
ggctgtgccc 
aatgattaga 
ctctctctaa 
cttctctgat 
caagaatcaa 
gacaatttaa 
ctgaaactaa 
gtatcagaga 
tattttataa 
agtgttttaa 
agttaagagg 
taaatgttgt 
agaagccagc 
gaggtgtggt 
gtcaacagcg 
tcaaagcatg 
ccattttcaa 
aagtgggcaa 
taagtctaca 
taaagtgggc 
tttagtcaat 
ggggagaaaa 
tggctgtgct 
aataatatcc 
ttacatgtat 
gcagatacta 
attgaacatc 
cccaqccctg 
attatagaat 
gaaatgaatt 
gtcttgaagg 
aaaagtatgc 
tgattggcaa 
tcatggatct 
tttcattctt 
actcccttct 
tggaactctt 
ctttttttct 
gttttgagaa 



ctatgtttta 
gttacagaga 
gtgctcagta 
catcatcatc 
atcctcaatg 
tgaagagaaa 
gagctggagg 
gatctcagca 
tttttgccca 
aagtcccctc 
aaatccagca 
aaggagacaa 
tgtaggctat 
gcaagggaag 
acttgagata 
tgtttgcata 
gagagaaata 
ggcaaagact 
tcaaaaattt 
tacagtgttg 
agactcccac 
caactcatat 
ttctgtttgt 
ttaaaatgat 
atacttttga 
gacgactcta 
gtgccttcaa 
gtttttttgg 
gcttcaactg 
gaccgagaac 
caagcaaaag 
atgttgcaga 
cttagacagt 
atgagaaagc 
aagagcttcc 
ggtggaggca 
gtgacaggtg 
attgtaaata 
ctgcatgacc 
tcctaccaaa 
actcagagca 
tgtgcagccc 
ctaggacttt 
tggtcctcac 
tactatgtca 
tcctacagaa 
atcaagatac 
ttttctggaa 
atgtgtagaa 
aggcctggaa 
actgttagga 
gtgagtgagt 
tcctcaaatg 
tccttgctta 
aattttagca 
ccaaatgtct 
actttgaaaa 



141600 
141660 
141720 
141780 
141840 
141900 
141960 
142020 
142080 
142140 
142200 
142260 
142320 
142380 
142440 
142500 
142560 
142620 
142680 
142740 
142800 
142860 
142920 
142980 
143040 
143100 
143160 
143220 
143280 
143340 
143400 
143460 
143520 
143580 
143640 
143700 
143760 
143820 
143880 
143940 
144000 
144060 
144120 
144180 
144240 
144300 
144360 
144420 
144480 
144540 
144600 
144660 
144720 
144780 
144840 
144900 
144960 
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gcgattcaaa cgttggtgca aagggaacta aatacttggt agtgaggagc atcttggcag 145020 
agaaggcagc tgagcttttg ctaaagtctc agtggttgaa attgtccagg aggcctgagt 145080 
tttggtccct gggctgggca gcatgtgctc ttggttgtca tgcctccatt cctccttctg 145140 
gaaaatagta aagaagaaaa tgtataagtt agagaggtga aagagtgtta gagctaatta 145200 
aatctcactt attttactta tgaggaaata actcccaaag aggagaaatg actaagttca 145260 
tataaagaac ctgcagctaa cttggtcatt tagtcatata aagaacctgc agcttacttg 145320 
gtcatttatg catttattcc attcagcaga catgtttcag tgcctcccat gtgccaggca 145380 
ctgtggtaga tgctaggaca atagtgataa gtaaataaac atgattcttg ccctaatcag 145440 
ggagatgaaa ttaaacaaat aaaccaaaag ttaaacaaaa taattaaaga tagagataag 145500 
gtatgatgga aagaaacaag gtaccgcaat agagaataaa gagtagggac cgggcacggt 145560 
ggctcactcc tgtaatccca gcattttggg aggccgaggc gggtggatca caaggtcagg 145620 
acatcgagac catcctggct aacacggtga aaccccatct ctactaaaaa tacaaaaaaa 145680 
attagccagt cgtgatggca ggcacctgta gtcccagcta ctcgggaggc tgaggcagga 145740 
gaatggcgtg aacccgggag gtggagcttg cagtgagccg agatcgcgcc actgcaatcc 145800 
agtctgggca acagagtgag actccatctc aaaaaaaaaa aagaaaaaaa aaaagagtag 145860 
gatgagtatt ttagagaagg cctctatgaa gagagaacat tttaagctga gatctgaaga 145920 
atgcgaaggg gtcagccact taaagagggt tggaagagct ttctaggaga taacagcttg 145980 
tgtaaaacct caaggcaaga aagtcgggac tgtcctgaaa ggtaagagaa agaccagcat 14 6040 
gaagcccagg cagaggtgtc atggcctgga agcagatgtg gtgggagaga taggcagggc 14 6100 
cagaactcag agcctcagag ttcaaagtgc ccttattggc agactcttga ttgatctcac 146160 
acccctgcgg aaaaggaaag gcaagtatac gtgcattgca agactctatt ggccgagcat 14 6220 
ttgtttcaaa tgtcgttttc gcagcatgct ttgctgagac catagattac aaaaacagaa 146280 
ataaaaatga ttcagctttt cttctgtgtc tacccctcaa cttaattagt gctcacagct 146340 
ccagaaatca gttgagggca ggggtcagga gtgaattcag atatagatga agtgggactt 14 6400 
agtttctggt gttgccactt ggttttaaag atgcagaact tcttatcttc catagcttta 146460 
ttttatcctt catttttgtc ctttcacttc ctcagtggat aaatcctagc tagaagatga 146520 
attttgctga tttagggcac cctcgggctg tcctctgtag ctttcagtaa agctttgtct 14 6580 
tgattgacag atgctgatca agttcatggg tatgcattac agtgtacgtt tgctggttgg 146640 
cctaggaaaa cccatttgca cgtatgactt tcataggaaa agaatggcaa ataagaaaca 14 6700 
aaagattttt tttctgccac tcacccaagg aacagaaatt aagacagcca aaggaagagt 146760 
ctgccttcat ttaacggatg atttacctgg tgttccttgc ggtagtggtt cattcgtgaa 14 6820 
ccagcagaag gtattttgtg actatgggga ttgcggagat gactgggcca ggaggggaag 14 6880 
ctgtaaagct aatctctccc acaacccact tctctttggc taatggcttt gcttttgttt 146940 
gttctgcttt ggctggttca tgtgctagtt cccaaaggct gcacaaaggc agagctaggt 147000 
agatgtactc cctaccaggg ctctttagtg aatcccacct cctagcccca gatgaggctg 147060 
agtgaacact cactccagcc ttggactgta gcctttactg tggcccctga acctaaccca 147120 
agaatgagac ttgttgaccc aaggctcaga ccccaagtct cagatccctt atttcaaaga 147180 
gagctcctca ccccagctcc agactgagcc cctggcccag cattcctggc tgagtcccca 147240 
cacgtggtag agccttggcc tcaaagggtc catccacaat gttgtgagtc ccttggtaca 147300 
tattcacttt ttgaatgaac ctctgaggac ccctgccctc tcaagactgg cgtctcctct 147360 
gccatctcat cagagtggct tttagccatg gagcagtgtc atagcctcct ctagtccaaa 147420 
gctggccggt gctaagaaga ggaagcctgg aactcagaac ctctgtcctt gtgccaacat 147480 
gttagttgca gtctatctgt gagcttcaca gtcccattga ctcttgtaca gaaatcagca 147540 
agcagtggtc ttcagtcaaa ctgggacctc aacaatcgac tttggtcctg tcagggttcc 147600 
tttcccttac tcttcttaag agcacccaga gtcaccatga aagaaccaca agtggaaaag 147660 
gaatacgtgg gctcagacgt gtgtccatcc ttgcatgacc cttggctcct catctgttca 147720 
tgggtttaaa ggtcatcctt cctgccctgc ctaccttatt gagtattcac aacagaaaat 147780 
gaatgtagga gattgaaaat tgtataatgc actctaagtt attattacct ctcccaaatt 147840 
gtcagcattt catccttgct atctagcagc agaatagtgt agggtaggag tttacaaact 147900 
acagcccaca ggccaaagcc agcccaaggc cagattttct aaataaagtt ttattgaaac 147960 
acagcaatgt aaatatgtaa actcatttgt ttacatattg tctttggctg ccttcatgct 148020 
ataaaagtag cactgagtag tgcagcagag accacatggc ccgcaaaacc taagatattt 148080 
actatttggc tttttgcaga aaaagtttgc ctatacctgg catagaggag caaagtgagg 14 814 0 
actctgaagt tggaatgcct gtgtttggat tctgaatctt ctacttaatg gctatatgac 148200 
cttgtgcaag ttaattagct tcacttttct ctctatgaaa taaaaataaa cattgtatcc 148260 
ccaccctgtg ctatcaaatt agcattaaat gagtaattca tgtaaggcac ttagcttagt 148320 
atttggcaca tagtgagtgc tcagtgttgg ctgttattat tatccaaaga atgaaagctc 148380 
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tggtttgtgt 
agtaggacag 
aagattaatt 
aaagagttat 
gaacatagta 
acttttctaa 
aaacatcatg 
tgcatattcc 
tctaggcttt 
ccatgactct 
gagtcactct 
gtgatgctaa 
tcttcagggt 
ggagactggg 
gagcacagct 
tgaagcacag 
tttagaggaa 
ctttatgagt 
tcaagggggc 
ataatataaa 
gtcaacttag 
atattcattc 
aataataata 
cacagtgtct 
tactcagtgt 
acatgatttt 
ggtttgatat 
tggtagctta 
caagcaaggc 
gaccctaaag 
ctgtcttata 
cttacaaatt 
cgcacacatt 
tagggaggtg 
gaaaggatct 
ggagcagggg 
acgtctgtgc 
ttaagtcaga 
ccctccattg 
tcttgtgaga 
cacccatgcc 
tatgtgatct 
tgtccacatg 
gcctctgttc 
ttcttagggt 
ctgtctaaaa 
ttcgcagcat 
taagctccaa 
cttgctgcct 
agaagaaaga 
gagtacaaaa 
tttagtttaa 
gttcacagac 
ttt ataaggg 
tatgtcagcc 
tct tagcctc 
atgcttaata 



tgatggactc 
aacattattt 
aatctccctg 
tgtgaagagt 
aacactcaat 
gtagtactag 
tgaggtagga 
aggcacatgt 
tatctaggcc 
atgactgttc 
atggaccatc 
ggagtttgcc 
tgagttctgt 
tgttgtaggt 
ctcattctca 
gggagtatta 
tatgaaacaa 
tattgttcat 
ttacatctca 
ataaaataat 
tatccagaga 
atgtccaaaa 
taagctttat 
agcccataga 
ttagatgtat 
gggctctaat 
gcccagcatg 
gtaacgtgga 
taatcttgaa 
agactgtcca 
tgaggcactt 
acagaaccta 
tttatgtaca 
taagggtctt 
gcaaagatcc 
catgagtgta 
tggaactcag 
tgttcctgtt 
tctctcatcc 
ccagctagga 
acacatgaag 
ctttctcctc 
ggccttctgt 
cattccttct 
tcatttctca 
tagtcctgcc 
ccaaaatggc 
gagagcagag 
agcatacact 
tggagagaag 
gttctattac 
actctgttat 
gtgcttcttt 
aagctccagt 
cacagggtga 
gtatgcagta 
ataaatacac 



tcaggggatg 
atatcccagc 
aacttcagct 
aagtgataaa 
aggtgatata 
cttctcagaa 
ggttctactt 
aaattaaata 
aatagtcctt 
ttaggtgaag 
ccggaaagat 
atatggggtg 
aatgggatgt 
ggaatacatt 
gttggcctaa 
cgtatcttcc 
gtaacagcac 
tatattagca 
attgcaggtg 
caattcattc 
ttaaaatagg 
atattgtgta 
gagggcaaag 
ggaaagacaa 
gtagttccta 
ttatgaaata 
catctcctct 
tagttgttct 
ctgtgtaaac 
tgaaatgctg 
tttcagctgg 
aactaataaa 
tttaaaagat 
tatgaaggag 
ttaaagaaag 
ggcccagagc 
tgaactcact 
cccattttct 
atggactgga 
ctgaccctct 
tgccggatac 
acctgcatct 
ttctcaagct 
gcctgagact 
gctccaatgc 
agtcactctc 
cttgtttatt 
acttatcttt 
acacattcag 
aaacagaaaa 
aggtacttca 
ccagagagga 
gcatttactt 
tgatcagtct 
gtctaaattc 
ccacatctca 
aaatatgacc 



gagtagcatg 
tctgccactc 
tcctcatgca 
atgcatgtga 
aatacaataa 
caaatagaca 
atcattaata 
cactttaaaa 
caagaccagg 
tgataggtca 
cctcttctca 
aaaatgctct 
aaatcttctt 
aaggctggtg 
gctaacaggg 
agacctgtgg 
cgtgcaattt 
gaaaaatgag 
aaaaggcata 
agtgcaagac 
taaatgactt 
gtcattatgg 
gcatcatctg 
ggactgagtt 
acacatttgt 
ctgcaatcag 
ctgttcacag 
gcttcccttg 
agttatcaca 
acatctagat 
ggcttcctca 
gcatgggtgt 
gggtggagac 
atggcatttg 
aaagcaggag 
ctacggccga 
gagcatccca 
agcttatcct 
cttgagctat 
taggccaaag 
cttagaagag 
ctactgcttt 
cttcagtctt 
gcttctttag 
tagtgacaca 
acatcatccc 
cacatgtcat 
ttatccagtg 
gaaacttgtt 
tttcaatctg 
ccaaaacaaa 
aaaatgcagc 
ctccacaatc 
gtttgccctg 
caggaccagt 
ttggctttca 
tgggaggatg 



gtttatggca 
accacatgaa 
tgaaatagga 
agtgccaaga 
taaataagaa 
gttggtagtg 
atgctctgga 
tggagttgtt 
tacctttctg 
aggagaacag 
taataacacg 
cattttgaac 
tcctggggag 
gggtttgagt 
agtgatcaag 
atgctagatg 
tcgttattcc 
gatttggaga 
aatacacaaa 
atgtcatgaa 
gcattcattc 
gaacccttca 
tcttatttac 
agacatagat 
ggacatttat 
acatttagta 
caccttaatt 
acccaggggc 
aatggctgga 
ttctagagct 
tatccttcca 
tagataaaaa 
tttcagttta 
agcttaactt 
gcaggaggcc 
gaaacaactg 
tgtctgtcgt 
actgagtcac 
gggatgttag 
tgatgctaga 
ctcagtttta 
cctcctttcc 
gttcctgagc 
cctttagcct 
gagaggtctc 
gttttatttt 
ctctctccct 
atgtataccc 
ggatgggtga 
taggacctca 
aaaagtttga 
ccagagagga 
atgtcaggca 
gccccctgcc 
gagcccagcc 
atgcctagta 
gcttttcttc 



caggccatgg 
tgactcggag 
aaaagaaatt 
atagaatttg 
gaagaatggt 
actactatgc 
caccaggttt 
tagacagctg 
tactaacaat 
catacacgtt 
tgggtacaca 
ccattcaaca 
aaacggctga 
gccagcccaa 
ttcagtggac 
ggcaacctaa 
ctacctagct 
gcctcttcct 
aagttacttg 
agaaagatgg 
atttattctc 
aagggatatg 
cactctattc 
ccttatttaa 
ggagctccta 
gacatctgtt 
ttcctctggc 
ggcccatgcc 
gcaatggtga 
gctgcaattt 
aaattctgtt 
attgataaga 
tagctaggag 
ttaaagaatg 
tgtgcaggct 
aaaaacagcc 
gccattacac 
aacaaccttc 
agaggaaagg 
gccctccagg 
ctcagagagc 
ccctatactg 
ctcattcaga 
ggctggctcc 
cctgaccatc 
ctttgtaata 
ctctagagcc 
ctagcaccag 
atgattaggc 
catccccagt 
gaaacactga 
gaaggggctt 
cctgaagatt 
cttggttaca 
ctgttctaat 
attaagaccc 
cacatctcct 



148440 
148500 
148560 
148620 
148680 
148740 
148800 
148860 
148920 
148980 
149040 
149100 
149160 
149220 
149280 
149340 
149400 
149460 
149520 
149580 
149640 
149700 
149760 
149820 
149880 
149940 
150000 
150060 
150120 
150180 
150240 
150300 
150360 
150420 
150480 
150540 
150600 
150660 
150720 
150780 
150840 
150900 
150960 
151020 
151080 
151140 
151200 
151260 
151320 
151380 
151440 
151500 
151560 
151620 
151680 
151740 
151800 
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tttgcagcca tcttccaact gggcatcagg 
aaatgaatcc aaaaccaaag tgaaatgcta 
tccagcccgg gcaaggccct tggcaaccta 
ccccgacccc agtcatactg gtgtttctgg 
ttggtgcttt ggataaagtg gtccctctat 
tgcccggcct ctgctaaagg acccagttaa 
atcctgtggc tgatcccttt attactctct 
actcactgaa cacttgctta agctctgtga 
gtggtacccc aattttctgg caatattagt 
actaaataaa tgataacatt gtatctagta 
tcctaagtaa atgtgatatt gtttaagggc 
tctttgcatg tcagcattat atctacgata 
ctcttattgt cctctgagtg atggaaatga 
taagatgatg aagaatgtga ttcccaccaa 
agtgcctgaa agtcccccca ttcacacata 
gtcctgtcac ctcattatct gctatttaca 
ccaaggagaa atgtcggtgt gagaggcctt 
aatgccagga gttccctccc tgggagacag 
tttattgata ctaagtggat ggtgccctaa 
aggtgaggat agaattggac agcccacgcc 
ggttcctgcc ctcctgcttc cccaagaggg 
attgttggac acgctctgct acagagtttc 
ctggcagggc tccctgaatg ctctctctat 
tctacccata tacagtactc tttatgcaca 
gattcctcct tatctttcag tgtcaaagct 
tccccaaagg atggggttag gtgtccctgc 
gactgaattg ctattgccta catagttgtc 
aaagtggttg gggatgggct ggtttaccac 
acaagtgcca gatactgggt gtcgaatggt 
gtgtgtatgt gtatagactt cccagggatg 
ttttatttgc ttatatttca atagattcta 
gggaagacag tgtgctacat gagaggattt 
gactatttca aacatccatg acatatcaca 
ctggtagagg tcatctctca acttgccttg 
gaaacccacc tcagggaatg aaaggagtta 
ggcacttggc aggtgttgaa gtatcagctc 
ccagtcacag tggctcacac ctgtaatccc 
gcttgagacc agcagttcaa gaccagcttg 
ttggagctgc agtgagccat gatggcacca 
ccctgtttca gaaaaaaaaa aaaacaaaga 
tcaagtgctt tctaaatggc aaatactgcg 
gcattaaccc agtacacaca gagttctgat 
cttgtcatat ctgacaatgt agtctccatc 
atctctctac tgggtaactt attcacatag 
actccaaatg catctattca gaagtcacaa 
ggctgaggca caactcctgc agagtagtga 
caggcaggag ggggacagtg caggatccca 
agagacttac ctgcttactg gatttgaaaa 
aataactaaa atattgtcga atcagtaaac 
atgtcagagt ctgaaagggc attaagcatt 
gaaagctgag gcctaggaga gcgtgggatt 
ataaaaccaa aaagagaatc gtccaggtct 
gaaccatgtt atctcttctg attgaatatt 
cctcacaact actaattata tttatcaaca 
cttgggccta tagggtctgt ctgtctgttt 
ctctctgctg ttgcttaggt ttggctgaag 
gcaactctgg gagccagggt tcactcaagt 



gcctggatga agaaagttct tgtgagtctc 151860 
cattgacaca tgacacaatg acctaacttc 151920 
gtccccaccc atctccaact ctgtgccata 151980 
agtttttctg ttgtgccctt ctcttttatc 152040 
ctagacacag gaacaaggtg cttaacactg 152100 
cactgactat aatagcagtt atgatcattt 152160 
caaatcaccc tttattttcc ttaatgtaaa 152220 
aggtagaaaa catgtctgtc ttgtgtaatt 152280 
ttctggtgcc aataaatatc tgttgaattt 152340 
aatgacagac accctggatg tgtctcagaa 152400 
tttgggatga tttatatcta ttatggtaat 152460 
caagagaata taaaaaggga tggacaggca 152520 
agcacatccc ttgatgttac aacgttagag 152580 
taaggataga attattccag gaggggaatg 152640 
cttttgatct ccctccccta aaatgaagct 152700 
atgaggctgc cacattccca gggacaggag 152760 
aaatgaagtg acaataaaaa aggagttgtt 152820 
ctgtcagggc cagtacccag ggctgggaga 152880 
ccaggagtga ttatcaaaga ggtttgagtg 152940 
atgcagtgtg tgggttttcc tgctctcact 153000 
aattcctggc ctcgtttttt acttactctg 153060 
ttggaggcag ggccctcaga atctgacact 153120 
ctctgccagg cctttgctgt tgctcttcct 153180 
tgtaactgta ccaaaacctc tctcctggct 153240 
ggcacttcct cttggaagct ttctttgagg 153300 
ctgctgaact catccttaac atagcatgta 153360 
agtctctccc attgtcaaag tgacaatccc 153420 
tgtatctcca ttgcctaaca tagtgtcttc 153480 
gggtggatgg atagatggac caaagaatag 15354 0 
agacaggaat acatcaaaca taccactcaa 153600 
atgtagtaat gcctctttat ctggcatcag 153660 
tccagaaact gaagctgacc catgtcccaa 153720 
gacattaagg actcaagctc tgtcgtcaga 153780 
tgctgtttct agacctcact tctccaattt 153840 
atgtagagaa atgtctttgg cacagtgtct 153900 
ttaattatta ctatcagaaa gcattacagg 153960 
agcactttaa gaggctgagg tgggaggatc 154020 
cgcaacataa tgagatccca actctacagt 154080 
ctgcactcca gcccaggtga cggagccaga 154140 
agaagaagag gaagaaagag aacaccatat 154200 
tttcatccca gcaacatgat ggaagtctgt 154260 
tctgaaaaat gccccatctc ttctctccta 154320 
tctggctccc tttctagtct gcctcctttc 154380 
caggttaaaa gttgtgtctt aacacttaag 154440 
aggtaaatct gttcaggaaa gtgagatatg 154500 
tctttgcaaa gaggtaagaa aagcaggctg 154560 
ggattttgtt acaagggaat ccaagaactc 154 620 
agtgcactgg gggttgctag aatttgacta 154680 
attttccaat caattagact agactctaga 154740 
atagagatca aacaatcatt tcacagatgg 154800 
tacttaaagt catacaagta ctctgtaatg 154860 
tctgacagct agcctagtac tttgtctaca 154920 
ggttgtgaca gattcaaagt agatataata 154980 
tttaaccaat gactgccttc tcccacaggc 155040 
ctgtctctct ctctttcttt tacttcctct 155100 
tgtagaaatg gtctgacagt caggcactga 155160 
aaatccaggc caggctacct ccaagccatg 155220 
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ctcttttagg gagtagtgcg taaatgcacc 
ttaaggtaag agatgtttgc agaaggggaa 
cctggcctcc tgtgtgtcta aggatttgtt 
catctcaccc tttcctactt cacttctaaa 
cccagaggcc caactaaatg tgtgaaacca 
cataagattt atatcctgta ggataatatt 
aaaaatactg tagagagtat attataatat 
acttcaccct ggatgatgga ttgtgcattt 
aatatgtgat gtatgcttcc tggtagagaa 
attataatag acatttaaag acagtgccta 
attagggtgg aggagaaaat agtgtgagtg 
gcaacccagg caggggagca actagttcag 
ttcagctcct cctcagcccc tgctgcaggt 
tttatttatc aaatatttat taagcatcta 
gcacaaagaa gcatgtccta attttcaggg 
tatacaaata aacaaagaaa attattttag 
tgatgtgata cagactgatg aggggtcact 
acagagaggt gagatttggg ctgaggcaca 
ggtaagggaa gagcattgca ggaaggaggg 
atgagcttgg agcccagagg aagagagtga 
agaagacggt tgtaagttag gtgaagtcag 
gtcagatgtt gagacttttc ctgtgtgcaa 
agaaagaaca atgtgttttg tttgttttgt 
tctgggtaga gaagtgattt tgtcactgaa 
acaccacctt cagataccag gagggctata 
ccagaatctc agtaactaga agtatgcaag 
tgcagatggt atgctggttt tcagtgggag 
aatctttaag agattatgac ctttaaagcc 
gactcacagt agacaattag gtaccactgg 
aaaagatttt caataacatt tcaaaagtat 
atttgacaaa atctgggata aaaatgggta 
gtgaagtatg aaactctaaa acagggacaa 
caaagaagac ttatttttta aggagcatct 
ggcaatggtc gcatactgta atttgagatc 
aacaggtgat atggtaaaaa gagtaaagga 
gaaaaaaata ataataccta agtcacaagg 
gtaanaagga ccacaaagca ctctaaaata 
ctcaccatcc accttggaag ggcaattttt 
cactggggca aagattatga actctcatcc 
ctgactgact tggagtttaa tcccatgacc 
aagtataatc aaccacaaac caggattggc 
tgtgggtatg ttctcacagc taagctatta 
tcagctagca agatgtacaa gcacaaagtg 
taagggctag actgggaaaa tcgacacttg 
tggattcttc ctgtagcacc tttttatgct 
aagatcgtta gattaagagt caagaatatc 
cttgaatatg ttcattaaat ttctgagcct 
tcatcccttt ctagcctcaa tagggtcatt 
tgtcttatat acatggctac tgtaattaat 
atctaataaa attggattta cacagatgaa 
gggtagttag atgggtagga tggatggaca 
atggatggat tggtttgttg gatgggtaga 
tgggttggat ggatgaacta gtggatggat 
agtcttctat tcattaatcc ttctcttctc 
tctatgccat ctttgggttt tatgaacgta 
gggccctcca cgtctgcttc attacgccga 
tccttcagct gcgccctgaa ctgcaggatg 



ctcttagcct tagaggagaa agcactctgt 155280 
aatgaggaga taatttgaag gggaaaatgt 155340 
gtgtctgcct gctcccattt cgtccacccc 155400 
gtcctagcaa gaaacaccat ccatgagggc 1554 60 
agtaggtgac atgagggaat agatctggat 155520 
ttattgcagc aaggggagag aagagatcct 155580 
tttaatgagg tttatgtgct ccccatatac 155640 
tgacaaatct atatttaaca ttgggagaat 155700 
atcatattta tggaaatctt tattctagtt 155760 
ttgtgggctg tggatagata gagtgtcctc 155820 
gcagaggata aatggctacc ttagaagaca 155880 
ggtgacccaa cagtcacatc tctgtttggg 155940 
catatgccct catcaagcac tcatcattca 156000 
ccatgtccca gacactctgc tggcttttgg 156060 
aattcacatt gactaaaaca gccacacata 156120 
attaattaaa gaatgggaaa taagacagag 156180 
atttagaaga ggtgatgaag caagatctat 156240 
actgctacaa aggagtgatg tttgcaaaga 156300 
gaacagctgg tgcagagtct cccatgcaga 156360 
aagccagtgt ggctacagca ggcagcaagg 156420 
aggagcaggc tattcatgct ctgtgtgatg 156480 
tgggaaaact ttgagtggat ttgcacaagg 156540 
ctttgttgtt taaataagat ctctctggct 156600 
aaccaaatgt cctaaagctc aaggccatga 156660 
gagtgggtca agagcaagaa ttgtgtgaaa 156720 
cagatgttaa tgatcacctg ccaggtttcc 156780 
taggacagat gggttatatg gttcagctgc 156840 
cattttctct agggactaac agaaaacgga 156900 
ttaattgcat tctttattgc agcatgggcc 156960 
tcaccaaaaa aaggtcttgg aggcaaatag 157020 
gaattcctca caatctttta tgcagctaat 157080 
agtatgcacc atttcccaaa tgtatttgac 157140 
tgtggggcta tagtttcatg aaaccttctg 157200 
ccatacacag tggcaatgtg gggttctgga 157260 
tctaagagca ttaacttttt tatcttaaaa 157320 
atgtgatgag gactaaatag cctaatatat 157380 
ctgttttcat gacaaggatg aatgactttc 157440 
aggcattgaa gatggccttg gggttggtct 157500 
tatatcatat agtaaaacaa agtttaggag 157560 
ttggtcttat tagcatcgta ttttaattag 157620 
aactgcattg ggattctaca gaatcggtcc 157680 
ctatggctga tacactgggg ggtctgaaaa 157740 
aacattagca gacaaatgaa cactgagagt 157800 
ttttcttaaa cgtgtattag cagcaatagc 157860 
ttgcctttaa ttgatagaca atgtgagggg 157920 
tgcccaagtc atatgtccta gctgtgctac 157980 
tggtttctcc cctttaaaaa aatgacccag 158040 
gtagagacca agtaaggtaa tagatgaaag 158100 
aaccatgaca acttctggct tctaatagat 158160 
tgaggcatcg aacaggtgaa tgtgtgagtg 158220 
gttggataga ctggtgtgtc gaaaggatga 158280 
tggatagatg acagttgggt ggataattga 158340 
tcattttgga gtcatctgac cactttttgc 158400 
ttgtagtctg ttcccagttc tccaaaggag 1584 60 
ggactgtcaa catgctgacc tccttttgtg 158520 
gctttcccgt tgatacatcc aatcagtttg 158580 
ccctcatcag catcattgac cattacaagt 158640 
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ggcagaaatt tgtctacatt tatgatgccg 
gacttttgag ggatggagag aaaattacca 
cctgaaaaat atgggaaaat tatccaaaat 
tcagtagctg acctctaatt cagtttggaa 
ccatgtcact ctacagctca ctaagctttc 
ggacctcaga ctttggagtc aggcagatct 
gctgtttgac tttggacaaa ctctctctgc 
aaattctcta tgccttcatt ttctcatgtc 
ataaaggttt tgtgaaactg aatgagatga 
aaaaactcaa aattctttct ctgttttaga 
aatcttcatc cttctctggg ctttctaatt 
ccaaatgttt tctatacttt ttcttctcta 
gaatgcactt cccttctcca atttacatct 
cctattcccc aattctgtgg atgggagcac 
actgtcctac cttctcattg tatactaaat 
taaacaaata aatgggtcac gcaagacctg 
gtggttgcct gcagggagtc ttagaaaact 
tttttcaggc agttaggcag gcgtcgttca 
aacaccctat tttctttgat tttctcttca 
gactgcgact attgcagggg tcacctggaa 
gtagctggag gcctggaagt aataaagcca 
tctctggggt cagcagaagc acccacactc 
gagctcagag ccctgccgtt tgtcagctgt 
aagcctcaat ttccacacta taaaataggg 
gtggattaaa accagtagaa cagtgccaga 
tcaaactcac caaacaactt gcactgaaaa 
caatctgcag atgattcttg aggccagata 
gtgttactga aagtcagaaa tgacatacat 
aatatctgtg aataattaat aatgacaccc 
agaacctgca aatatgtcta tgttacatgg 
aaggttgcta aactattgac ctttaatagg 
caatgtaatc acaaggtcct taaatgtggg 
tggtgcaatg tgagaaagac gaccaccatt 
ccaagaaatg caggtagcct ctagaagttg 
acctttagac accaacacag ccctgctgac 
atgcatctta cctcaagaac tgtaagatag 
tgtgataact tgttacagca gcagtaggaa 
aacaaagcag cattgagtaa tggcctctct 
gtttaattca ggacacattt tatttattta 
acagagtttc actcttgtca cccaggctgg 
acctctgcct cccaggttca agcaattctc 
caggtacctg ccaccatgcc tggctaattt 
catcatgttt gccaggctgg tctcaaactc 
tcccaaagtg ctgggattac aggcgtgagc 
aaacacctac tgtgtgagag acaaagtatt 
caaatttgat cagcagttat taggtgacca 
cagttactga gtgccgaata aaactaacag 
ggctctgcta tgtatcaaac attctcactt 
ttatttacac cacactttac ctttgaagaa 
acaaggccat ttagccatat gttggatttg 
taccatctat gttcagaaac acagtgtctg 
agaaaactag aactgagttt ggcagaagag 
gcctgatggg agtgggtggt gcccacaaat 
aactttaggc ttatccgtcc tgcagaaagt 
ggtgacagca gtcaacatct tgacaaccac 
cctggagaag aaaaaggagc ggctggtggt 
tatcttgggc caggtagtga aagcagcaag 



accggggtaa gccaagggtt aggggaggga 158700 
gcagaaagga gaatgatgcc tcactgcttc 158760 
gctttatttc tgagaaatct gactgattta 158820 
acgcattctg agaataattc tcttaatgca 158880 
atggctcctg agtccaaact caatgcaaaa 158940 
ggatctgaaa cctaggttct gtcctctcca 159000 
ctcaatttca gcacttatat aatggggata 159060 
tataatgagg ataaaaacaa tgtctatctt 159120 
ctatgaagtg ttcagtgcct gcaacatagt 159180 
tgttatcatc atcatcatcc tcctcatcat 159240 
cttcctcctt tgaatttaac atcacatttt 159300 
tgtctttact catgttattc cagttttctg 159360 
gtaaatcctt ctcctctttt gcaggatgtc 159420 
tctctccaac tcccatgatg tttgtcttgt 159480 
agtcttgtaa taatttattc actactggtg 159540 
aaaggtgaga cagaggagta cagtactgat 159600 
tttgagttgg agagaggttt gtgaaagtga 159660 
gatgggaagg agccaacaga agaaaccagg 159720 
ctgtccccac tttgatgact tcccttctca 159780 
gtctatttca ggacagcttt gcagcataat 159840 
gaaaaatgta cctttgcttg tgggagatca 159900 
tcaaaagagc tccacctacc tcagaactca 159960 
gtgaccttgg actaatgact aaatctctct 160020 
ataataatag tatctacctt gaaagattaa 160080 
cacacagtac attttcagta aatgttggtc 160140 
ggaaacaaag ataggaaggc atagcagaac 160200 
ccatattggt cactgtggaa gaggttaact 160260 
acaaaaaatt aatccatgca agatcaaaat 160320 
cccccccaag atgttcatgt tgcaatctct 160380 
caaaggaaaa tgaaggctgc agttgggatt 160440 
gtaattatcc tggattatgt tggggggaca 160500 
aagttggagt cagaagagtc agtgtcagag 160560 
actggccttg aagatggaag gggccatatg 160620 
gaaaggcaag agagtggatt cttccctaga 160680 
accttgattt tagcctggga cacctatctc 160740 
tacatttgtg ttgttttaag ccagtaagtg 160800 
actaatacac catctaattc tccaagaaga 160860 
actcatttag ctatagatca gaagagagta 160920 
tttatttatt tagttagtta gttagttgag 160980 
agtgcaatgg cacgatgtgg gctcactgca 161040 
ctgcctcagc ctaccaggta gctgggatta 161100 
ttgtaatttt ttttagtaga gatggggttt 161160 
ctgacctcag gtgatccacc cacctcgacc 161220 
cactgggcct ggcctgccag gacacatatt 161280 
aggtattaaa gatactatat tgaacattca 161340 
tttagatgct aggttggcac atttaataag 161400 
ctaatattta taaaaccctt gcaggtgtca 1614 60 
aatcctcata atgacattat gaggtgagca 161520 
actaaagttt gaagaggcta atgagcctgc 161580 
agcctcagat tgttttttaa tcactagttc 161640 
atctcacaga gttcagagtt ttgcagccag 161700 
tgggtttggg ggtagccagg ggaggtaggt 161760 
cctgactgtc atttctcttc tactcatctg 161820 
cctggataca gctgctgaga agaactggca 161880 
agaggaggga taccggatgc tctttcagga 161940 
ggtggactgt gaatcagaac gcctcaatgc 162000 
ggctcagggt gggtgcggga ggtgattcag 162060 
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gaatagccag acacactttt gccttgggtg ttataaagag ggttataaag agggttcttg 162120 
actaggtgag actaaaagac ctctatctca ttttctataa ttcacaaaat ttaattctga 162180 
aatagcacaa acaatgggag ccttgacata gggcttcaaa tggttctcag acctgttaac 162240 
tccaatgtat ccctctattg tttaaaaaaa aaaaatgctg gatgcagtgg ctcaggcctg 162300 
taatctcagc actttgggag actgaggcag gcggatcacc tgaggtcagg agttcaagac 162360 
cagcctagcc aacatggtga aaccctgtct ccactaaaaa aaaaaaaaaa aaaaaaaaat 162420 
ttagcttggt gtcgtggcat gtacccataa tcccagctac tcaggagggt gagacagtga 162480 
ggcaggagaa tcccttgaac ctgggaggtg gaggttgcag tgagccaaga ttgtgctact 162540 
gcactccagc ctaggcaaca gagcaagact ccatctcgaa aaataataaa aaataataat 162600 
gataaattac attccatttt agagtttatg aaatgttatc atattaggaa cacagcttat 162660 
agggccaaat ttctgggttc gactctggac tctgtcactc atgatccagg gtttggcatc 162720 
tcattccttt caccttctta actgttttga aaggctaagg aagcagcagc cacctttatt 162780 
tccattgttt ccctgaacat gcccttccct ttgagaaatc aattgacagt acattcttct 162840 
ataaacaggc attgtaactt caacagtctt gagctagatc tctctaaact gtgcttcccc 162900 
tggtctcaaa acaaaagccc atgcccacca cagaattagg attttttgta gattccttat 162960 
ctttttaaac tacatttatg tattcagatt cagagctagc aaggtagcag acgttgcaat 163020 
aacttttgaa caatgtcata caataaaatt gatagtttta ggtacttgac aacatgacaa 163080 
agtttgttaa tggccttcaa gacccctggg gccctcacag agaatcttgg tactaggaca 163140 
gctattactc agaggcctca gcaatagaac acagacagag tttgaatgtt atggattctt 163200 
taaggcacta actgttctac tttatgaagt acaaatttgg ccttccctct tggaaagaga 163260 
aacccagcaa aacttgttcc attgagatgg aagttctctg ggccggagtc agggatttga 163320 
attgcagttt tcaattcagg caggataaga taatagttaa gagcagagat aaaaccagac 163380 
tggctagggt tcaaatccca gctctagtat ttactggctt tgtgaccttg accaggtttt 163440 
taatctttct acatctcaag ttcttgatct gtaaaatggg catcatcatc ataataatag 163500 
ccacttttag agggctcttg tgagaattaa aagaatcaat atatgtgaag cacttagaac 163560 
agagtctgcc ccacagtaag tggtacaagc aagcatttgc ctatttttgc tattcctagt 163620 
gcctactagg accagcttct tgagtcactt tcttccttgg gcataagtga atccgtctgt 163680 
aaaatgtgaa tgaaggcctc tgttctataa aactgataaa aaggcaagat aacatagtcg 163740 
atggaaaaag tgccaaatga taatgacata ttaatagaca gatgacctat tgtatattgt 163800 
tctatggtac agacagtagc tattactact atttttttag tgtgtatggt ataccagata 163860 
ctatgccaag accttcccat gcataatcct acttcattcc cacatcaact ttgtaaaaca 163920 
gcttttttca attcccctat tttatagatg aagaaaagag agattatact tgcccaacat 163980 
ctcacagcta gcatatgata aaagcaggaa gtcaaaccta tcttggtctg gccaaagaac 164040 
ctagcttctt aaaaatgcaa cttttctgct tctgctatct ataattaact cctctctcaa 164100 
agggcaaact tgcgttttgt aaagaagtga cccgtttgta tgggtatttt agattgtagc 164160 
atattgacat tttttaaaat tgtttcattc atctattttg aaaagtaaga ccaaagagtt 164220 
gaaaggattt tacagatcac ttgattcttt gcttatgctt acattttata gatagaatga 164280 
ctgacagtca aattgattaa gttatttgcc ttagacaaat aatgaatgac agtaatgagc 164340 
ctacaactta tatctctgac ttacaatctc acacttagca tcatcctgga ctatctcaca 164400 
ttcagtaact gggaacgtac ctgggataca gttacattca gtaactgcat acagtctaac 1644 60 
acaaaatttt ccacctaatc tgaggaaaaa aacagtgact caaaattcca aatagggaat 164520 
taactgcctg tgtctgtctc cagcccccac ccaactaaaa tctaagctcg acgatgccag 164580 
ggatttttgt tttattttct tccctgctga gtccacaatg ccttggatag tgcttgacac 164640 
caaataaagt attcaataca ttcttttcag tgaattaaaa gaataaggaa tcaatgaagc 164700 
ctctaagaaa gttcttaaat gtaagataaa tctggatttc tatagatagc cattggctgg 164760 
ataagagcct ttaagatgga ggaaaaggca tgtcaggagg cccagagggc atgtgagggc 164 820 
tgggtttcat actgcagttt tgcaggctga gtgatggatg ctgagcgtcc actggagagg 164880 
atggcaggaa gataaacaca caatcaagaa tatccttgga aaccattaga aagagtttag 164940 
ggtttattct gtaaacaatg gggagtcagt gaaggtttgt gagcaaggaa caggattatt 165000 
agaatataat actttatcaa tatatgatgc caaagcatat aaaacagtca ttaagcacaa 165060 
aattagttag aagtgagaat agaccaggac ctgttgaggt tatccatcac atccccctat 165120 
gttattgtct tcatggtgct tgtacctgct gaaatcttgt tcatttatct atttatatat 165180 
tgaccatctg gcaccctact aaaatgttaa ctcaactcaa tgagggcaca gagaccttgc 165240 
ctcacttgct caatgctgtt tccccagcac atagcatagt gcttggcata taatttatgt 165300 
taaataaatg tttgttgaat ggattaatta gttaatcata gagggtttta cagaggagtt 165360 
gagattggag ctcagccatt aaggatacga acaatcagag agaatgggag aggccattca 165420 
gacacaaata ttgtgaatga aggcaatgca ttgggaatga gctttgaagt aatgggggtc 165480 
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agtgaggtga tggacccctt gtgaaattgc ttcttacatt tataacacct ttctcagttt 165540 
gttaccctgt atatatttat ccgttcacct atttattatg gatctcctct caatgaagtg 165600 
taacctccaa gagggcaagc accatgcttc tttcaccttt ccactgtgtt gccaatagtg 165660 
cttcacccat agcaagctct taatacatat ttgtgaaagg aaggtattag aggagggaag 165720 
gtcaatgtat aggttgggga atagatagat gggaaggttg gtatcaggtt ggatcatatg 165780 
aaattgccag tttttataag tcaaaagtca tcagttatca gctgtttcct ttgtttcaat 165840 
ttgtactttc atcttgtggg gacaatattt ctccattgcc tgaatgtggt tggttttaaa 165900 
atatcatgtc tttgctgttt tgtgtttatt agaagctgaa taggttcttg ataaaggtgt 165960 
ctcattgttg aaaaggcagc aaaccatttg acaagcagat gaaaaagtaa tgaagaatta 166020 
ttaataaaac tatcatccca atagtatttg agctccgtgg tctggaattt cacaatctaa 166080 
ccactatttc aaagaattca ttgtgtgtgc ttgcatttaa ttattcccca cataaaaaaa 166140 
catgagtcac tatgcacata catccaatta agtagatgtc tgaatgtgga ttttaataac 166200 
cttgaaaatt tattcagtta gagaagaagc ccaagtgctt atcatctgtt atatgtctcc 166260 
ttgcaacagt gcagggagag agacaccgag cagagtacac taaccccaat ttgatgatta 166320 
aaaaaaatag ggaattagat ctagttaatc tgcaggctga gctagatcta aaaccaaggt 166380 
ctgtagtctt aaccattcat tattaggtgt gtctcctgcc caagcaaact tctgggcagt 166440 
tctttccaca aatatttttt ctacctgtga tgattaggga tagtctgcat tttgggataa 166500 
aagggtattc attgacactg gttcttaatg attaaccata atcaaacatc aaatggaaaa 166560 
gttcacagga gtcaggtccc tgtcctgata ctgccaccag gctactatca gatctgggca 166620 
aaacattttc cctctgtggg tcttgacttc ctatctgtac agtgagagtt tggaccagat 166680 
gatcctaact ttcccttcca gcttctaagt tgggcaatcc attaatgtct gagtacctaa 166740 
ctgattcggg gaggaaggca caaaattcag aaaaagaaag aagagcatct atatatggtt 166800 
catgcaagga attgaaggtg aagtcatcca catacccatc tgaccattca tccaccatcc 1668 60 
atctattaac tcatctgttt ttccaatcag cgaatattgt catgacagat ttcatgcaac 166920 
ataaaattcc cacctaaaga agaatctcca catacaagga aggtttatga aagaaagaac 166980 
gcagtgagca tttcagatgg gtccttgggg ttacagtaac accactctac ccttggaatc 167040 
agacaaaact gatttcaaat tctgtatctg ccacttagta gctgtaagat tacaacttac 167100 
agaggttata taataggcag tccatgccta ttccttattt gtaaaataag gatgctaaca 167160 
atgccaccat cattggatgg ttgggatatt gtagttctag tacttagggt agggcacatt 167220 
gtaagcactc gatatcagct gccgttatta ctgttgttgt cgtggcttta actgttagta 167280 
tgattaatga gtctccacct attatgtttt gtagattata aagctagaga agaatggcat 167340 
cggctaccac tacattcttg caaatctggt gagtagagca ctgcaggctc tcagctcaag 167400 
tcctttccag gtttggggcc ctaccttgct tctgttgtcc ctggctgatg tgaactgagt 1674 60 
gggtggaagg ggcaattcag ggctgtaata atgagtcttg gcaatactac atttttatct 167520 
tctccacatc ccactcatca aaccacaaca cactattcat gaacctctaa ccttccttgg 167580 
aagagagcat actggtgggc acggctttat catcttcaca attccagctt taattggctc 167640 
tgccccttgg caatagggac aaaatcacca catctttgaa ttaatgcaag atgtcttgat 167700 
cttactggtt tgggtcctcc attttgaccc cagtgatagt tttacttctt ataaatacat 1677 60 
ttacatcata attacactag tattgacaaa attttaacca gaaacaactt cccagcatga 167820 
acaatattat gtccatttag tttatattca cttttctata tggcatagat aagtttgttt 167880 
taaaatatat atatatatat atttgtttta atgtactaca gatcattgta ttgagtgcta 167940 
ctgtttttta tattacaaag aaaacaactc atcaaatcat tttttgtata tattaaattt 168000 
tacctagtag cacatactat tatgagtttt cttttgtaac cattatttct gtcaagtcta 168060 
ttttttctgg gtcatgtttt gtacctccac cttttttgtc atgaagttta attcttctag 168120 
cttttctatc caaagccaaa tgtattttta tttggtgttt agccccattt ttaaaggctt 168180 
tattgagaca tattttataa agcacacaat tcatccattt aaagtgtaca attaaatggt 168240 
tttcaatata ttcacaaagt tatgaaacca tcaccacagt cttattttaa aattcttcat 168300 
cactgctagg agaaatccct tacctattag caatgactcc ttatcatccc ctaagctcct 168360 
ccagccctag gcaaccatta atttactttc tatccgtatg tatttgcctg tttggagcat 168420 
gtcatatatg tgaaatcata caatatgtgg tcatttgtaa ttgacttctt tcacttagca 168480 
taacattttc aagtttcacc catgttgcag catgtatcag tacttgattc ctttttattg 168540 
ccaaacaata tttcattatg agcatatacc acattttctt tatccatttt tcagttcatg 168600 
cacatttggg ttgttttcac tttttgacta ttatgaataa tgctgctata aaatattagt 168660 
gtacaagatt ttcactttaa tatataccta ggcatggaat tactgggtta tatgataact 168720 
ctatgtttaa cattttgagg aactccaaaa ttattttcca aagtggctgc atcacccatc 168780 
aaattttaat caataacaaa tttttccatg tgacagtttg aatgatggca agtgtctctt 168840 
aatgataact atttctacag tttattatcc tgttctgtct ttgcaatttt ccatgtttcc 168900 
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attaagaagt 
ctgaacatat 
caatagtaat 
tagaaataag 
ataaattctg 
ttctttcccc 
agaagggctt 
tttctcagag 
agagtctcac 
agcccttaac 
tccacttaaa 
ggtcacacag 
aggccttttc 
aagaatacac 
aattacctct 
ccccaggtaa 
caccaagtca 
taatgacctc 
caaataaaaa 
attaattcag 
ataatgcaat 
gcatagaact 
gaccaaggaa 
tctttctgtc 
atcacttcgg 
gaggagggga 
acaaaacaaa 
cccttttagc 
aattgagaac 
ttgccaaatg 
cttcacatat 
atatatatgc 
agcattggtg 
atctctcaca 
tcctccaaac 
agacatctgg 
gcaaatctta 
ttctgtagag 
attgccaata 
gaattttact 
agaatttaca 
ttgaccagtg 
atctctctca 
ccgggttttg 
ttttacagat 
caggaaccta 
cataagaata 
ttcctt aaac 
ctgatt gcta 
caatagccct 
cttttccaca 
gaatcccaaa 
ggaaagagtc 
tctacagtag 
tctgtctcac 
agcattcact 
cacaggtcct 



ggtttctagc 

aactggcttc 

tgaacaaagg 

gcacagggtg 

tccttctttc 

ttcttcaact 

agagaagtag 

gtatttgcat 

cctccaaaga 

atcccttgac 

cctcacttgt 

tgaggcagca 

tataactcca 

agaacaatac 

ctggggtgag 

ttacattgcc 

gcctcttggg 

tcaaatgctt 

agacttgcat 

tttgaaacat 

tcataagacg 

gacccactcc 

ggctttgaaa 

cctgtcggaa 

aggaatgctt 

ggagattgta 

acaaaaacaa 

tggctctgta 

aataattcct 

caccatgctt 

taaatccgcc 

ttgcatatat 

tgggcataag 

gatgtgcctg 

cctaacactg 

cagccagagt 

gaccaaatga 

ccagtttgga 

tgagtaccta 

ggaatgaaag 

cagggttcag 

tgaacaatat 

tttttatacc 

gatgtcataa 

gtgggaactg 

attaatggag 

agatgatcag 

ccctaataaa 

cattctagac 

tgaggttgat 

gtcacaaaac 

atttgtgttc 

aggaaagaga 

cagcaataga 

catttactcc 

tgttagtgcc 

ctgttttgag 



tttttatcta 

agttttaacc 

attaatattt 

catgaatcta 

tgacctggta 

aacttgtctt 

gttaatttag 

tttaaatgag 

ccatttgcaa 

tcccagatcc 

ttatagggaa 

ctagagctag 

ggtcatctct 

atagaacaag 

atagaaattg 

agcctccatc 

ttgttgtatc 

acaactgcca 

gtttcgatag 

agtcgcagta 

ataattttta 

ttcggaatgt 

gatttaacaa 

tgcatgagct 

ctggggttca 

ggttagagaa 

acaaacaaaa 

ttcttggaca 

actctaccta 

taaaaagtta 

catctgtaca 

atccatattt 

atgaacttga 

ggtcctggaa 

actgtgcatt 

ccttcagagt 

gtgtttctca 

aaatgtggca 

aaccatggag 

cacatatttt 

cagactcctg 

ggtgccttct 

atattatagt 

tactcctatg 

aggctgagac 

tttggctgtc 

gctaacctga 

ttacaacaat 

actatgataa 

attatcctca 

atataaagtg 

ttattaccgt 

gggggttgtc 

tctgacattc 

atggtttagg 

gcattttcca 

aaacactcat 



taacgaattt 

atgttatttt 

gttactcatt 

aaaatgtgac 

tccagcctta 

ggcaatattt 

gcccagctgt 

ggtggctctc 

aaatggaatt 

ttagtgctag 

acaaccagca 

gcctgtcacc 

cagtgaaact 

aatacacaga 

catgccagtg 

tgttgccaag 

cttctgatct 

gctaatagga 

attagttgat 

tgtggtgtgt 

gaatttctgc 

ttgtgattag 

aaggagggag 

tcaaagtagc 

aatagccacc 

gacaacttat 

aaaaaaaaac 

actctccaat 

ctcatagtgt 

aaagtgttct 

tattcacata 

tataggagag 

accatgcaga 

cacacaagtg 

cactccctgc 

tgtagttctt 

aaacagagga 

ccaacaagcg 

taaatggtca 

aaaattctca 

tctgaaaaat 

agaggagtaa 

ttatgtgata 

aagtaaacaa 

attacatgac 

aggaatatga 

tgctgagaca 

aataacaatg 

acacttaaat 

gatgaagaaa 

atggagctgc 

gccatgtgag 

ctggacaaga 

caacccaggc 

tactcatatt 

aactggctct 

ttggtctaag 



tcaggtttta 

gtgttgatta 

tctggagtaa 

caaagcatcc 

gatagatccc 

cggagtgagg 

taatggctaa 

cagacacttt 

tttgacttcg 

agcggactta 

aaggggtgtg 

aagggtagca 

ttgtaggcac 

acaagggtgg 

gccttagaga 

taacccctca 

gcatttttct 

tgaaaattgc 

aattaggctt 

tttcttcccc 

cacataatta 

ggtgtaatta 

ctacactttt 

cagcagcatt 

tttgaagggc 

aggcaaggag 

cttgtttttt 

cctcagtttc 

cactggggag 

gcacaaatgg 

tggtgtgtat 

atatgtatgt 

aattggaaag 

ctgagcctgg 

tctgagactg 

atctttgtga 

cctgtggacc 

aatgcttcag 

gattgttata 

tagagaacta 

attgaatttg 

taacaatagt 

tttctgtatg 

ggtggaattc 

tgagctgcag 

caaaaaccaa 

cttgagtcac 

gcaataactt 

gcatgtcaat 

cagagcttca 

acttctcctc 

tgctgggcag 

cagcttgagt 

tctcaaattc 

ggcaatcgca 

acagacacac 

atttgcttct 



acttacttaa 

tctctatgag 

agtctgttag 

tgcctaatcc 

tgccttactc 

acttgaaata 

atatattacc 

tggggaacga 

gagtgaggaa 

ggaagtcaac 

acttggccaa 

ggcagtattc 

cctgctgctt 

ggataggcat 

cactgaaaga 

gtcacagcat 

tcttctgact 

atataccatc 

caaattacta 

cttcagagga 

gcctactgct 

tagagttaga 

attttacagt 

cttttagcag 

tgccggaaga 

tcaaaacaaa 

gtccagatgc 

ctcatctgtc 

accaaatatg 

aaaattctta 

gtacatatgc 

ttatttccat 

ctatgccact 

ctgccagtgg 

tggggagtgc 

ctcaccagaa 

tctgtagtgt 

tgccactgca 

ctggtattta 

gaagattggg 

ttggacaaca 

aatatcaaca 

aatcatacta 

atcatcccca 

aggagaaaga 

gagatgtggg 

tgaattagaa 

ccacaattga 

taaaccttcc 

gaaaggttga 

aagggccatg 

ggagagttgg 

ttaaatcttg 

ccattggaca 

gtggcactga 

cacagaggtc 

ggtgagtcac 



168960 

169020 

169080 

169140 

169200 

169260 

169320 

169380 

169440 

169500 

169560 

169620 

169680 

169740 

169800 

169860 

169920 

169980 

170040 

170100 

170160 

170220 

170280 

170340 

170400 

170460 

170520 

170580 

170640 

170700 

170760 

170820 

170880 

170940 

171000 

171060 

171120 

171180 

171240 

171300 

171360 

171420 

171480 

171540 

171600 

171660 

171720 

171780 

171840 

171900 

171960 

172020 

172080 

172140 

172200 

172260 

172320 
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aaaggtaaga actacaacta cattgatgga 
cagaagctga ggaagagatg acggagttag 
aaccaggaca gttggccatt tctgccatgt 
attggaagtt ttagtctatt tggcagctat 
tctgcgggat cagccctgag cagaaactgt 
aaatctgtca gctccttctg caaagctgag 
tgttggctcc gtacctcaca gttatgttta 
atagtagttg ggaagggtgg ctttggagcc 
ttattagcta tgtaacatta gtcaatttat 
tttaaaaaag agaatgatat tatgtacatt 
tccctcaacg ctctcggaaa tgaaatcaca 
atattttttc acttactata cttgagatta 
cttccctgtt tatctacaag ctcctagagg 
attcccaggg cctggcgaag tgagttgcac 
tgaatgaatg aatgaatgaa taaatgaatt 
aaatgtactt aaaggcaaaa tcttttagca 
atatgaattt agttatgaat atgcaaataa 
aacccacttc atgtagagag ccagaatctc 
atcaggttca tgcttgaata atacccagtg 
ttttcttctc aaagggctca gacagtttct 
gtgctgctca ccctgtgtca ctctctcatc 
caaaacttac ctgtgccttc ccacaaagca 
cttactcttt ggtgtttgca ccttcagtct 
ctcaggatgt ggttcttaca gctgtccaga 
tatgcctgat actgcactgg gctcatgcaa 
gtcccagaag caacctgacc ttttcttcca 
caaaagaatg ccagagaaaa atcacagcct 
cctaccctgt cacagtgtgg tgatcagagg 
tgggaaggct cgtgaggctg catctacaag 
cttgtttgaa gaatccagcg gagtgagagc 
ggcgcctcat tttataaaat agacactcaa 
ttcacccaga agttaaacag caaagctggg 
tttgtggctt cttttctcca aactgccttt 
gggtggggca ggagcagaga cagagacaga 
ctccccactg tgcccctctc aaggactctc 
ttcttcctcc cacacagcct tccctctccc 
gacactgtgt gatctccaga gctgacactg 
ggtggataga aagacaaacc ccagatgcca 
atcccagtct aggttactct ttgtattcac 
acctctgaga ttttctgcct ccctccaaga 
agaaaaatga agttcaagtt gttgaaaaac 
attcaaactt aaaacctttc ccattctaag 
gcccaaactg gggaaacctt tgcaattctt 
gaaataaact cctttcttcc ttaaactgcc 
agctgtttgt ccattgcaag tcacttctcc 
tgggaatatt aatgtggatc tcataagata 
ttagaacaca gagtgcaaca gaagcccagg 
accatttatg caaatcatat ttttctgtaa 
gttttcaaaa ggaaaaaaat cagcaaaaca 
gttttcctga ttgtcttcag ttcagaggta 
tcccttaagc ttaaggaaga ggctgttcta 
tcgatcttgc aaaccatgca cacctaggct 
agcatccttt ggaccatttc agatatgtaa 
tggaaactca ctgtcaccag cagccttcaa 
cagatggttg gtgagctatg taactagaag 
ctttttaaga cagaaaaata gagaatacaa 
ggattttagg taaatactta ctatgtgcca 



attttattaa aattattatt tctagtgtga 172380 
cccattccag gtcttatcgg ggtacttggg 172440 
gcttgcagat taactcctgt agaatgttag 172500 
tccaagcccc cagccccatg ctgctgctcc 172560 
tctttagtct ccttgctccc ctgtggtgat 172620 
ctcatccact ttcttcagtg cctctcttcc 172680 
gtttctgcag cttctggtta ggcaggtggt 172740 
agttctgggt ttgaattctg acttttatac 172800 
ctaccttttc tgagtttttg ttgccttatc 172860 
cttatagggt tttatgagaa ctaagtgaga 172920 
tctacctgtt ttatgctgcc atagccccag 172980 
gataactgat tctctattat ttcatgtctt 173040 
aaggattgtg tgtgacttat tctctgttga 173100 
ataacaggtg tgctttaaaa agtcaatgaa 173160 
cctagaaaat aatgtctgca cctacatatc 173220 
cctatgctag aagaagtaga ggtagtagga 173280 
gtagttaaag acaggcctaa gaggccttct 173340 
ctctacaacg tccccaacag aaactcagcc 173400 
atggaaaagt cctacctctg agacagcccc 173460 
gcactcttcc tggaacaggg cacgacctgt 173520 
acttctgact ctaatgccct ctggctcttg 173580 
gcctttgagt atttgaaaat ggctaccttg 173640 
ggacacaccc ctgttccccc agtgccttct 173700 
caaccaccaa acatggacca ggcctgtccg 173760 
gaaggactaa gaagcatgag gctgaacttt 173820 
gctagattag aacatgtaaa ataacattct 173880 
gtgaaaatat ctaagcctca tagatggcca 173940 
ccagggaggc cgcctcagct gtggggtgtc 174000 
gccagcagtc atgtagagtg tgccctcccc 174060 
tgaagggtac ctatgaaact catctcaccc 174120 
gaatgagaga aaggaggtga tctgtctgaa 174180 
ctaggatttg gagtattcac ttgctccttc 174240 
tcacaatgtg cccatatgcc tctgtgttga 174300 
gacagagagg gagctcagaa gtgcttccta 174360 
tttgctgatg aagcttattc cagtttcctc 174420 
gctctgctct cacctcactg ttcggtaaca 174480 
cccagagcct ttggaaggaa aaggagagtg 174540 
aaggatgctg tcttccaagg tgatccaagc 174 600 
ccccatcctc tgagagaatt caccccagag 174 660 
aaagtaagat tctggtccca ttttataagg 174720 
cttttgtaaa gtcatttgac taaaactgga 174780 
gattaggctt gttctccatt gggccacact 174840 
aggactggta ttataattga gcaaagccaa 174900 
tgggtttgaa atctggctca gctacttact 174960 
tctttgtacc tcagctgtct cacctatgag 175020 
aagataaaat gagttaatgt atgtaaagtt 175080 
gtctcatcct aactctgtca cttactcatg 175140 
gatctcaatt tgttctagaa ttggcataat 175200 
atcttcctgc ctgtgtgatt tagggttcta 175260 
tcaaaaaata tctctcatag ccacactggc 175320 
ttaatctcaa gcacttggtg tgtgtatctt 175380 
gatttggaat ggcctttttc atgctgggtc 175440 
gacagtgact atgattgctc tgacagagca 175500 
gcaagagggg ccccttagca ctaattttgg 175560 
agtctggtcc ttgtttcctt acctatccct 175620 
tgggagggaa atggagctcc aggagacaca 175680 
agcactgttc tctaagaatc tgacaggtaa 175740 
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catagttaat cctcacaaca gctatgagaa ataccataat aatccttatg ttgcagcaaa 175800 
ggttactgag gcactaaaag gttgagtagc cctcctgagg ttgcacagct aggaggtggt 175860 
ggagctaaga tttgaatcta ggctgacatg cacaggcagt ttctgtgaca gagggcagta 175920 
accctctgtg gctctgactc tgaacctcaa tacatgcata ttttctgctt ggtgttccca 175980 
gccctcttca cccaaccaaa ggcacttttt ttttttcagg cttcatgctt ttgctgcaag 176040 
accttccctg agctccaaga tcaagtttgg gtccactcat tttgctctat aattacttgt 176100 
gtaatattgg cctttctcac tagaccttag gtaacttgaa agcaaggacc acacctgtct 17 6160 
tcttctctgc tacatttcta tgtatagcat gggcctgaca gtgttaagca ctcatgagtg 176220 
aatgagtaaa tggacaaata aatgatttac tacaatattt cttacctggt tcccagtagc 176280 
tttcatattt atcatgtttg gactttcatt cttcttttta tcaaacattc aatgggcacc 176340 
tactatatgg gaggcctgtg aaggtgtaaa ccggcttttc atcaaaagat gggcatgcta 176400 
tgagtatttg cttggccttt tctagatagg aaggttttga tcatgtaaag ggtagatgca 1764 60 
attggtttta agtccactgg gaactgtagc cataagggcc cacctcatac caggaaattt 176520 
gctcttgctc tgaaatggcc ttggccaggt gtcattcctg accacaacag ttggggcaga 17 6580 
gtaaagatga agggactctc tggaagacca actattccca ggccaggtag tacacccatg 176640 
cagccttgca ctcaatagcc caacctggag tccctgacct tctgccagaa agtgccagca 176700 
ttttatccta aatcccaggg ctgagagaaa ttagtttgtg agaccctgac tgacatttac 176760 
aatgctttgt caatactgga tacttgtatg aggtatgtct gttggtgtgt ttgcagatat 176820 
ccaagtgtgc ctccctggtt ccaaatgtaa ctggataagt caacacgaat gcttttttct 176880 
tctcaatatt tatggtgttt tcactgattt aagcacagct acaatacctt gtttaattag 176940 
aggaaaccag actaccctct gtgttaccag agattatata gggagaggac accttaaagg 177000 
tttggttggt taaagtggct aaaataggaa aatgtgaaga cttttatgca gagacatggg 177060 
tatttgagga cacaattaag tcttcagccc aggagtattc cacaaattca cacatatcca 177120 
tattcatgcg cacatacctg tgtgatcata gaattgtaga tcaaagctgg aagagccctt 177180 
aagagagaaa tcaagtggct cagaagtttc caaacttttt agtttgaggc tctattttca 177240 
aatggaatct tgggggagaa ctctcatata taaaacagat aaaaagcaga gctactatgg 177300 
tggaaaagag agggtgggat ttaggaccac atctttctgc cttttcccta acctaatatg 177360 
gtaacttcta aaatgcctgc aaccaggaac atagcttata agatcattga tgtagccaaa 177420 
gttcttattt tgcagatcaa tcaactaatg gctatacagg ttgagataat tgcccaagga 177480 
caaacaactc atgagattca aagacaggat ttatgtggtt ctccctaaaa ctcttttcct 177540 
gacacttctt atttatgacc tggttcacaa tagactacac aaaaaagtca ataagaagac 177600 
ctctttgatc cacaagacaa aagctgaggc tgagtggtga caggattgaa gcttataaaa 177660 
aaatcatgag agggtgagaa gaaactttct gttccttcat tcctccacat gtattgagca 177720 
tgcaccctga tccaggtact gcttgtggac gctgtggggg ccaacgtccc tgccctccag 177780 
gagctcacag tctcacaggg aagatgagaa gggcccctgc acgttcaaag agaaagctag 177840 
gagaggccta gagagatagg aataaagtgc tgtgtggttc acatttcaag taccagcaca 177900 
cattaaagga gctaagggaa atgccctgat ctcatggaag cctgatactt ccttcttggt 177960 
ataaaaatgt gtgctgctct gagtgcttca ttgcaggaca ggacataaat gaatagattg 178020 
gcctcactag tggttgggag gcttgatgta cagtggttag aaaacaagtc ctagagtctt 178080 
tttacttgtt ggtctctgtc cctcactagc tatactgtac tgagcaggag ctactccatt 178140 
aacttaagcc tcaatttttt aaatttctaa tttgctaatg ggaatgataa tagtacccac 178200 
tttacaggat tgctatgagg attaaatgag ataacacaaa taaagtactt agcacagttt 178260 
ctactacaca gtacaggttt gatacgtgtt aggtacttgt aattattgtt ttagccaaag 178320 
tctgccttct atgacatttg ccctttcatt ggctctggcc tgcgttccag agcctcagag 178380 
aacaaatcat ccatctcttc cccatgacaa cccttctggg actcaaatac atccttcact 178440 
tactttcttg gtcttctata cttttttagg ctgataattc ccacttttaa aaaaatactt 178500 
ctaatataag acttttcaaa ccacagtttc ctataactac tctctcaatg gagctgtcat 178560 
ctaagatttt atctgacaat ttggggaaac ttggcatgtt caacctgtgc cattcagagt 178620 
atacagtaca agttctttga tttttctctc aacttacctc ctgtggatct caaggggatg 178680 
ggatggagga agtgctcttt ccagtgtccc aggatttgtt taataaacct tacttgcccg 178740 
tttccttgaa caatgttaga tgctccttct tccttctgac tggtgtggtg tcagcacagg 178800 
caggatgatg agggctgaac agctcgtatg tgaccccttg aaagaatccc agagcaaagg 178860 
aaatccagct ttgaagataa agcctttcat ccttcgatag ctatgttcct atacacctgc 178920 
ttcatgttct ctcaggcttc ttgcctgacc cagtccctgt atttgaagca gttctaacac 178980 
ctaacctctt ttgacccaat gataatgata atggtgaatg tgatgatcat aataatagct 179040 
aacattcatt aagcacttac tatgtgccag acatggtact aagtgcttag ttttattaat 179100 
tcttttaaac cgttcaccaa tcctgtgatg taggtgctat atctgcattt tgaagaccag 179160 
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agaactgagc ctcagagagg ttaagtaacc 
ccaattgcac gtgataatta atagtcataa 
aagcattctc caaatcttca cagcaagatg 
acacctcttt aggatatcat aattacttga 
tataattggc atatgataga tggagtgact 
ttactcccca tcattgtact caaatgcctt 
atacagagga gacagattct gttctcaact 
cactattccc ctctttgggc ctcagtttcc 
cctcctacag tggatataga gaggagacag 
catataacct taagccacta ttcccatctc 
gaagggtttg gtcttctctc caaaatcttt 
atggtccctg tgtcaaatcc agccatcacc 
attttttaca ttttttaatt gttgggaaaa 
ggaactatat gacattaaat taccatgccc 
ttactatatg ctaaatagta atagtaaatt 
tgcaatggca gagctgacta gttgtgacaa 
ttataatctg gccctttaca taaaaagttt 
ggacactaat aggcttcaga taaactagac 
gttagtgtca aggagaggaa cctagcgcag 
tacatttgaa ggcccctgct ctggcagggt 
acagtctgag gttcattgtc cctctttgtc 
taaagtcata tgtttctcca ctggtggagc 
ttatagttgg accacagttt gaattcttgc 
ggtctcttaa ccatagctcc ttaacagaaa 
gcaaataagt attgggtgtc tactgtgcac 
aaggattatg tatataaagc atttaacaga 
tgtttaaaat acatctcaaa caattattat 
aactcttaaa aagctgtttg aataaagtta 
attattgtca tttttgttct tataagcaat 
aacagcacca gtgtgcttta atttcaggcc 
aaaagaccat aacaaagctt caacaaatgc 
tactctgtcc cccactttcc agtgactaat 
tgaggatcga tttttatttt ttaatagtat 
agatattaca gaagggtata aagtaaaaga 
catagaagta actattttaa acccttttcc 
acttctacat catatattta tgtttctttt 
gacttcccat tagaaagatg acaaatcagc 
atgtttgtta gcctatatgc tattatttgg 
acatagtgaa ccttgatttt tgattcacca 
cagtttcctc tttagggcta ctgaatgata 
cctcaaggta ttctgatacg tccctcagaa 
caatttcttt gcatttaata atgacagtag 
tcacacatgg ggcaatgtga aatgctacat 
gacctaatag ccctaggaac tcactcatca 
ctcaaagaga agcatctggt accagtgctg 
cctgcattct tgtgctgaat agctggaaaa 
tgtagattaa tttcattttt tttttgtcag 
catttgggaa catgttgtaa ataggcaaag 
tagttaaaat ttttaataga atataattag 
attaatagca caaataaagc acataaagaa 
aaggtttttt ttaataaaat ggcaaagaaa 
aatgagagtt cccaggggaa cagaatgagt 
aagagttaac acctttattt cccccttctt 
tcttgaatta tggagaaatt tagtctagta 
aagcattgta gagatgcaag attattattc 
tgtcagctat tagatgagaa tatagttcat 
tttacttaac atcatagcta tgagcttgtt 



agtagacaca tataatcagt tcttttcttt 179220 
tgatggccca gagcctcttc aagattctta 179280 
aaagcatgtc caatgtagtt gtagtgttaa 179340 
aatgcctttt cattctaatc ctttgaaaat 179400 
atattaacta gtttacttaa ttgagcgcat 179460 
ttcaacattg tgttacctcc tacagtggat 179520 
aatcctctca ccagccatat aaccttaagc 179580 
ctatctgaac aatatgaggg gtttgtttta 179640 
attctattct caaccaatcc tctaactggc 179700 
tgggcctcag tttccctatc tgtacaatat 179760 
gctcctgtat aggtcaaaag tcagcaaact 179820 
tgattttata tggctcatga gctaagaatg 179880 
aaatcaaaag aatgatattt tataatatgt 179940 
aaaaatattg ggacagccat gctcttccaa 180000 
actattttct atatatggct actttcatgc 180060 
gaaccatatg gtttgcaaag cctaaaatat 180120 
gctgacccct tgcatagatg tggaataggg 180180 
actttggaag catctaaaga cccttaggag 180240 
tcccacaaat attcgttcaa gaaagacagc 180300 
gacatattct ggctattaac tgtgccagcc 180360 
tggagcaatc ttgacacacc atcctaaagc 180420 
agtagaacac atttgtgaag aagacagatt 180480 
ttgctgtgtg atctcaggga aatttccaaa 180540 
attaggatgg atactcactc actcatttat 180600 
atttacctct tggaattgtt taagtaatta 180660 
atgcctgact caggaataga gcttggaaaa 180720 
tcttttgttt ttaaattaca gtaacctcaa 180780 
tataatgcat acaaaggggc atttttagcc 180840 
gcttctgaag ccgttttcag tggagcactt 180900 
aggagataaa aaattaagca gggctctctt 180960 
aacaacttct ctcccctgtc ttgccactgc 181020 
ggatgatatc taagatacat tagagctatc 181080 
taaaaataca atatattcac atgataaaac 181140 
taaatgccca ttctttccaa ttctatactt 181200 
tgcatttggt tatttaatgg tttcctttac 181260 
tcttgcttta tcaactttaa ctccatatct 181320 
tcatattaca cccccaacat ccccttctca 181380 
aattcttctg atgattacct tgtatttttt 181440 
gcattagagc tttgtcaaaa tacagatatc 181500 
atccttgggg atggttattt tgcaaaagtg 181560 
ccactttctg agagcttaca gtttattggt 181620 
acacagcatt ctttctcacc gatttattag 181680 
aggtgcagaa gaagcagaca agaacacagt 181740 
ccaagcttca ctttaggtga gaaacagtca 181800 
ggaggtcagt ggaaggagag tgggtttaga 181860 
acacttgact aacctgaaaa acaagtactc 181920 
tagtaacagt atatagagca atggtcccag 181980 
tggctcaaag agagtagtaa tcaaacttcg 182040 
caatgtagta tttttatata atactatata 182100 
aaatgttcta attaccaaat attaccctct 182160 
attagaaaat tatttaatta aaccttattt 182220 
gaattgaatt ttcccataaa ccagtgggca 182280 
agtcaagaac acagttgtgc aggttcattt 182340 
actataatag agtcatttct tagtgattaa 182400 
ccagcatgaa atctgaaata ggagggaaga 1824 60 
gaccacaagt tatttcctaa ttttgtttct 182520 
ttatcattct ctgtccattt tttcttagga 182580 
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ggggatgggg gttcagttgg ggagtcttat tagtttgatc ataagtaatt aaggtttggc 182640 
tgcactaacc ttgttgtcag acattctggt gacaagattg agtggcattt atgttgagca 182700 
gttgaggaaa ctcggccatt tcctttggcc tccattatgt ctgagatgga agcagtcagt 182760 
atgaaatgcc caggaccttg ttacagattt tcaagagaca gaatttgact gcaggagata 182820 
ctttccattt caatctccca gtagttactt tatgaacatg ttctttttaa agtctctaag 182880 
aactatgaga aatctttcaa ctgtgctcta gcttttaacc agtgctgggg taagtgctct 182940 
cttagatggt ataacgtctt ggaatgcata ttgggtacag gaacctagtt ctgatttcac 183000 
ctttactaat ggcttgctag aaccattgtg caaatcagct acatctgtga aactgtaaac 183060 
tgtggctgtg gaacacgtta aactcaaaag gtattttcaa cctgaatttc tcttttcaac 183120 
caattacaat cacccactgg tatttgctgc ctgctggtca tatttctcct ggataattgg 183180 
ctttctttct tgaggaaagt ctctggcttt tatttgttca ctcaacaaat atatactgtg 183240 
cacctactct gtgccaagca ttgctctagg agcaggttct aggacacatc agtgaccaag 183300 
acaaaaaagg ttcctgctct taggaaactg acatttcaat aggaggaaag aggctatggg 183360 
aacacatgag taaacaagat aatctcaaat ggtgataaaa actctggaga agctctaaca 183420 
gggcactgag atagagcaga ctggataggg tgctgggggt aactctacta tgttcttcca 183480 
gaagcagttc cctgaaacaa ggattcaagt acaaggggct tcttgggtgc caatcctaaa 183540 
aaaccaggga agggaggcag agaagggaag tctgcctaca aagtttcata aacaagatgc 183600 
ctacactgca ggctactgaa ggagaacagt ccctgagggg atcccaacat gtcagtgtag 183660 
aactcacctc agggtcatgc catgcagggg agaggattct ggggcactta tctactgact 183720 
cttcatctgt cattgattga aagctgcttc tgggcatatt aactccttaa catgtctaac 183780 
ttagcctgca ggtaggcgga gtagccaggt gtttgcagtt ggtacgtaaa gtagaatacc 183840 
catgtagaaa ggggagtgga agcgaacagg atgctgagat tgtctacaac agggcaagaa 183900 
ctagttcttt agataagaag tcaggaaaca cctctctgag agatggcagt ctagctgaga 183960 
tccaaaaaat ggaaaggagc cagacaggga aatcacattt ccaggcagag agaatatcaa 184020 
ggacaaaagc tctgagtgtg actaagcttg ctttgtttat aaaaatagaa acaataaaaa 184080 
atcgcagaac tttagaggca taggggtggg ggagggttgt ttttgaatca taaagtctat 184140 
tcttttagat agaaaaatca gagaaagcca atgagaaatt cagaaacata gacggtatcc 184200 
taggatccca agagggagct gggaccgctg ctctaattcc ttggctccca gttcagaact 1842 60 
tctccatcga ccatggatcc cctgctcatt ttcccatttc attcattgga atttggtggt 184320 
ttcccccaag ccatggagct gaaagggaat tgcctgtatg atcccactca aattccttgg 184380 
gcagttttcc aagatacgat caatggcata tgagtggaag atctgcatgt gacttctgaa 184440 
aaaagttctt catgaagaag ttgtgcattc catttttttt cttcctgctg ggcaggcccc 184500 
caattatacc taaaagaaac ccaagttaac acaaggctct atgtgaccag ctttgggctg 184560 
ccaatttgac ctcctttccc accctttctc ttcaaactca ctgtgccaca atcacaccag 184620 
cctccttact gtttcttgaa tatgccaatc atcctctcac ctctgggcct ttgctcagaa 184 680 
tgctcttctc caaatatccc catagctcag acacaggcat cacttcactc aggtccctaa 184740 
gccaagacca cctccttgcg gaggctcatc ctcatcactc cttccaaagc atgacactgc 184800 
ccctccccat cctcttacct cactttatcc ttcatggggc tcattgctac cgcagagtat 184860 
actatcccag attatactat actatgctgc atatttactt gttcatttat tgtctgtctt 184 920 
ccccactaga atataagctc cataagggaa gaaacttgct ttattcacta ttagaatagt 184980 
gcactatgac ttttcaaaaa tgtttgttga aatgattaat cagtgaataa gtgagtgagt 185040 
gactgagtga accaatgcct aacagaggcc cttagttctg gtgctctctc tgttactaac 185100 
tccctaaatg acgcctctgc aagttgattc ttctactctg cctcagtttc tttatctggc 185160 
aaatagacat ataatgttta ccttaaaaaa ggataaactg aaatgaggcc atagatatga 185220 
aagtactgca cattattgaa tatattatac aaatgtaaga tttcctttta cagtcgaagc 185280 
cttcttgagt tttagtaaac ttcaccaaat ggggtggaaa gtgtcattat aataaacaat 185340 
ccttcatggt tgctggaatt gatcaggcct catggagtgg tgaaaattta ccctgttgta 185400 
tggaaaagag cacagggtga ggatggagac tgggatcaga actcggcctc ttccacagac 1854 60 
ttgctggtgt ccttgggtaa ctcataggag cttgcccagt gaaattcaga ttcctcttct 185520 
ctcaaaatgg ttgggtattg ggatggtaga cgagattatt ccctccttag aggtttctac 185580 
cttgtactat atatctgcat catcataata atcacagcca ctatttattg atcacttggt 185640 
ctgccccatg cacttgacat gtgtcattgc attgtatgtt taccccaaca accagacctc 185700 
cattattcag acaagaaaac tgtcaagtaa cttagttaat agtggtataa ctactggctt 185760 
atgcatagca gcgccagcac tccaaactag gcatactggc tcctgagctg tgctaatttt 185820 
gtgcacaaac taagtaataa tcaagttcaa aggaaaggag gcctaactgt ctctccattg 185880 
agtaggtttc atctggtgga actgaatgga aaatcctgtg gttttggaac aggttaagtt 185940 
gatttatcca ggcagcatgt tctaacttct ccctcctccc cctctcacag ggcttcatgg 186000 
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acattgactt aaacaaattc aaggagagtg 
actacacaga cactattccg gccaagatca 
accacacacg ggtggactgg aagagaccca 
ggccagcctg gtccctttgc ctgccccaga 
atcagttttc taaagcaaac aaattcaggg 
ggtgcagggg tcagacaacc caaaatgaca 
ctctcctgtg aacaaaactt aggagtcagc 
tatcaaatac ccaggcttct tgtaccttct 
cctaatagta caagagggct gcttaaacta 
gaagaaaaaa aaacatgcaa tgcacaactt 
agcagatctt ccattcatat gctattgatc 
caagggaagc tggaaaatgg tggttctatt 
ggaggatgct catttcattg tagattgagc 
gctcagccat gtgcccagct aaaaatcagg 
tctacctggg agagtccact ctcagtaatc 
aacacagctc taaatactgc caccaaaatc 
aataaggata gcgaatgatt ctttttctga 
atggaaccct gtgttgagaa ggatactaag 
atctttgaca gactttctct tgccactcaa 
taacggagat acttcttatc ccactattca 
tatttcaata tgatcttaag caaacaaaat 
tacataggaa ttaattaagt aatgtgcaat 
agatgacaaa aataaccaaa tgtaatttaa 
ctctgttgcc caggctggag tgcagtggca 
cgagttcacg ccattctcct gcctcagcct 
accacgtctg gctaattttt tgtatttttt 
ggatggtctt gatctcctga ccttgtgatc 
ttacaggcgt gagccactgc gcccggcatg 
aattaaattt aaaaaattag aaaaataccc 
acttggagtg tgctaggcaa cataataagt 
ataggttcca aagccaggat atattttcct 
atcctagtcc cctttttgtc aggagagggt 
ggggaaagtt tacaggatat aaacttacca 
gagctcaccc ttctttcatt ttctttttgc 
tcgttcctct tggtttccag ctgatgactg 
tgaactgtat ccagcctgca gatgtgttat 
ccaacatttt aaaaattgca agattgcaca 
aaattataac tggcagtact gaactcacag 
caagcagcag cacccatctg cttgaatatt 
accactgtct gccccacgta taccatgcat 
gaaaacacat tcccaaatac agcacccaaa 
taactgtctc cattcctccc actagtacac 
gatggctgag gctttccaga gcctgcggag 
tgctggggat tgtctggcta acccagctgt 
agctctgcag caggtaagac caccaatgtt 
tttcagcatc aaattccaat aaaacacagc 
agttcagaac aaccactgca ttgttggtgt 
ataaaaaaaa aatgatatgg aacagtcatt 
tagcaatttt gctgaaaaag gagagctggt 
atacacaagg atgaaactga accatgttaa 
tgattttctg gatttcttcc tttgctctct 
ggaccctaaa aatgtttctc agtacttctg 
tcatcaatta tgtttgcact taagaatagt 
aactccaatc aattggctat acaaaagctt 
agcataaata ttcaattagt gatatgtctg 
gtgggtgtat atttaacatg cctggtttag 
tgctcttggg gctcctgtag atctctaaag 



gcgccaatgt gacaggtttc cagctggtga 186060 
tgcagcagtg gaagaatagt gatgctcgag 186120 
aggtgagtgg atgggcagcc agcagcaaag 18 6180 
tttctgagct gcattagtct ttacaaagga 186240 
aaatatattt gtaaagatta ttaggtacaa 186300 
tggttttaag gagggagaca ttttatttat 18 6360 
agtcccagac ttgtaaaatg gctccacagt 186420 
tgctctgctg tcatccacat acggctgctg 18 6480 
cagccattgc ctcagcattc tacccagtag 186540 
cttcatgaag aactgttttc agaagtcaca 186600 
gtatcttgga aaactacccc cagttggctg 186660 
ctgggcagag ccaccatgtt gcccagctca 186720 
ttctttgtgt agtgaggtgg cccttattct 186780 
gcaacctaac taaggaatat ccagcagatc 186840 
aggcagactc tctgcttctc acttcccagc 186900 
tattttgttt gttatttgag aatgcttggg 186960 
agtcaaacca gatagcttga cagtggccat 187020 
gccaagtttg tttatctgga caaacagtga 187080 
tccatttctt tttataaatg gcaaaccgta 187140 
aactgattta actcttcaat aatgaagact 187200 
cgcaaatatg ctttcaagca atgagaaaac 187260 
gattgcactc caatttccaa aagactgagc 187320 
tttttttttt ttttttgaga tggagtctgg 187380 
tgatctcggc tcactgcaag ctccacctcc 187440 
cccaagtagc tgggactaca ggcacccgcc 187500 
agtagagatg gggtttcacc gtgttagcca 187560 
cacccatctc agcctcccaa agtgctggga 187620 
taatttaact ttttaaaaaa gaagtgagag 187680 
attcattcgt tcttcaattc tctattgtga 187740 
attaggaaca aattagtgaa gaaaacagac 187800 
ggactgctct cctgattttg tttcatccac 187860 
ataattgata gttgttgtga gaaagaatga 187920 
cttaggtaag ccatatgtgg aaaaaattag 187980 
ttttactttt acaagagatg aaagagttaa 188040 
taatgccggg ataaaccagc tgatttttca 188100 
atgtaatcta tgcatttttt cccaaatgaa 188160 
cagaaaccta gaatcctggc ttctcttcaa 188220 
ttcaaggagg tcagcattga ctagcatggt 188280 
tatgcttctc agggacacca tccctgattc 188340 
ctggcccact gcacaccctt ctcatcacca 188400 
cctgccttcc tgtcagctct ctttgatacc 188460 
ctctgcgctc acctacgatg gggtgaaggt 188520 
gcagagaatt gatatatctc gccgggggaa 188580 
tccctggggc caagggatcg acatccagag 188640 
tgccccatct cataggagcc tactggggga 188700 
tattctaaag aaaaggaaga aaatgcctga 188760 
tggtggttct tagcatacat tgaagctcat 188820 
tcttgtagtg taaaagctgt attttctcct 188880 
gattttctgt gtatgggaaa gggacttgaa 188940 
tggtctcctg gaaacactcc cctcttgccc 189000 
gcacacattg tatgagagcc tggggtatgt 189060 
atactacctt cttgcttctc agcatttcat 189120 
taaagggcag ggatagaaaa tggccacgcc 189180 
ctgaggctga atccagatac aacagagaga 189240 
ctgtggaagc agcatgagac agtggtacat 189300 
aagattttgg ggacttgccc tcagctttat 189360 
cacctcaagc ccaggctatg attctcaaac 189420 
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tgccaaggaa tagcactata tttttgttct gatgcctcct gctagagaag ctgtatttcc 189480 
gaagggacag ggagaggatg gccgtgcttg ctgcaaagga ataacaaagc attggccaca 189540 
atgacaaacc agcacagcaa gattatttgg cacaaatgtc cctcaggacc caccctcccc 189600 
aacaaatgga gttgcctgcc taagaccagc tttactttta agacgacatt cagcagaaga 189660 
tgaatttggg ctccttgggg agaagaggag catttctgaa tgagagagaa atgtacagca 18 9720 
aaatgtacaa ctattcccac ttgtttgcca accacacgtg cacagctcta gcaactttca 189780 
caaccaggag aaagagatgc tgaaggaacc tcaccagctg ttcctcagcc ctgatgagtg 18 9840 
cctttctgta gagaagaata tgtaggtctg tcctttgaaa tgaagctaag caggatttct 189900 
actagctaca tgaggaacat gctaaggagg acctccagct gccatttctg ccctcccctg 189960 
gtccctgtgc agggtggctc ccaagtcact aggcagcaac aacagacacc aggcagcttg 190020 
caggggaaaa ttgccaaatg gacagtgcct gcgcctccct ttctaccccc accccagctt 190080 
atttaaaatg ctcttccaga ctccacagtc gagacagatt ctgtttgcaa tgtgaggctg 190140 
gtttcaaaag gtcccagaga tttgtcctca gcttcacaac ccttccaact cccttgcctc 190200 
tcccatctta atgacgtcct gaaaggatag tctgatgcca ccatgacagt gatgaagcct 190260 
gggaggcagt tcagggtggc aggaaggact ctgtactggg aggcaggaga gctggactcc 190320 
acgtccagct gggcccctgg ctctctggat gaccttgaaa cattcactta gctctcctgc 190380 
tgccttccct ttccttcccc atgaaaatga agggggtgga gagaggacct caagtctcaa 190440 
ctggctctga aaattcaaac agtctaattt tttttcttat ggtctaaccc atgtccttct 190500 
ggttgtagtg tatacttagc acactgcttc tgccctcagt agggtttgaa aggaagcagt 190560 
ttccatagtc aaaagaacaa tgacctcttt tattttgtct tggagccaag atatgtgtgt 190620 
gcctgccctt catcagctgt tctttgagga tctatggtgt gccctggctt ggagccaagc 190680 
actgaggaca ataaaagaaa gcaaggcctg cgctcaagga actcctggca gtgtctatgg 190740 
agaaaaagag aggaacaact atcccaaata aataaataaa taaacaaaca aacaaacaaa 190800 
ccagaacatt ctgaccttaa gatcagaggt tcaaatcgat tcattttcca atctttgtta 190860 
agcacctaga atgaaaaaga ttttgttgaa agacacctta gagattactg actcttcttt 190920 
tccctcactt tataatgaaa cagcaaatcc agagtgggca gtgactcacc cagggtcaca 190980 
cagcagctta gtggcggagg ctggaccaga tcccagtcca gggctcctct tcccccagtt 19104 0 
aggtctgtgc tatgggccca ggggcccttg cttataagca ttgacctctg gagccttcca 191100 
tccatagtca ggactactct tcagcaatgg ctttttctag acccatctct tgaaatagcc 191160 
atgccaccag ggagctgggt aaatcctgat ggtatccagt cttagaaacc tgaggcttgg 191220 
aggctcaggt ctctactgcc ttcacctctg gccttcaggt cacatagtgg gagagaacct 191280 
gtggtttagc ttgtgttaga cctgggttgt ctataagtta ggccaagagg tccagcacag 191340 
cctacctgga gggcaagcat gctagggagt taaattctga aaataatatg tatggtgtgt 191400 
taaaggctta gtatacacaa agctctaaac atttacacat attatatcat tggatctttg 191460 
atataacacc ttgaggtagg tagttttatt aaccgtggtt tgtaaatgga gaagccgagc 191520 
ttcagacagg ttataaaact tcccaaagtt atacagctag ctagtggtaa agttgggact 191580 
ccaaaccaca tttagatgat gcctccctct gtatgcttca ctgctctgaa acctcaaggg 191640 
cacggccaaa cagggaagac acaagagcta tggcccagcc cccagcagca catggccttg 191700 
ggtaatatct gtgcaccccc tcccccttca ctccgctgta atttccctag cagcagaatg 191760 
aagataatca gctgcctgcc tgccccctgc ttgccatcta gggagtcatt cagaatatta 191820 
atgaacccac atcctggagg ccctcagcac tccaggaagg atgctcatat gtcgccttct 191880 
ggacggtgga gaagactcaa ctccccaagt cgggttctta acatagggga ggtaataggc 191940 
atttgagaat atgatttttt aaaagaaatc tgtagattct tttgcttaga aaaatgcata 192000 
tgtgtacata cataatttct gcatgaccag gagtagagag aacatacata ctcctagaat 192060 
ccatcccagg gcctcaaatt tagaatacct cccctaaata gtaatcatcc ctcttggtct 192120 
ttgtttttgc aacctgccag ccaacaatca agcatatgga agtgaaggta tgctcagaac 192180 
tttgctagat ggggtctggg gttgggtggg caatggagtt taagacgcag tcctgccttt 192240 
aagggctaac acaggtgggc ttatggtgct gtggatgatc agagcagaat gccatcactg 192300 
aggaatgcag acatcagcaa ggctttctgt aggaatgtga ctggcctttc aataggatga 192360 
ccactttatt atccaaaatc tatcatccaa acgggaacac ttaggagtga aaggaaaaac 192420 
tactatcaat tattccagaa caagagtacc agaactctcc tgcgttagtc catgtgcatt 192480 
gttataaaga aatacctgaa actggataat gtataaagaa aaaggtttca ttttagctaa 192540 
tagttttgta ggctgcacac gaagtacact gccagcatct atttttggtg agggcctcag 192600 
gaagcttcca atcatggcag agtgtaaagg aagagacagt atgtcacgta gtaagagaag 192660 
gagcgtgata gataagggag gaagtgacag gctcttttac acaaccagct ctcatgagaa 192720 
ctgagtaaga acttattcat taccatgggg agggcgccaa gccattcagg aaagttccgc 192780 
ccccatgatg caaaaacctc ccaccaggca ccacctccaa cgtccagggt catattttaa 192840 
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catgagactt ggaagggaca aatatccaaa 
gtggtaatgg aaccgttaaa tgtatggtaa 
ggttctggat ggggatgtca atgataccca 
aaaggacttg cacatgtcac attggcctgt 
caaccgtggc tccacgcaga actatctggg 
gatctggatt tttcctgtct gctaaacaga 
ggcttgtcac agggctcaga attttaagaa 
ttcatacctg taatcccagc actttgggag 
tttgagacca acctggccaa catagtgaaa 
gcgggatgtg tagcaggcac ctgtaattcc 
acttgaacgc aggaggcaga ggttacagtg 
agcctgggtg acagtgtgag actatgtcaa 
ctttgaactt cagggaagct gcatgagaca 
caggtaggaa agtgaagctt aggagggttc 
tgtgacccag caaggaccta gtctacaggc 
atccacctga gggcaactgc aagggggata 
gctgtctcct tcaggactgc ctttcaggat 
tgacacctga caaagattct ttgcttgacc 
taggcccatc tgtgcaattc ttgtgaaatc 
ttagcaagaa ccctgtccac cacgtccacc 
caccatcccc cagattatgt cggatcatcc 
ccattttagc cagaattcct cttaaccccg 
gacccccacc ctgctccttg gctatatatt 
gcccaatctc tctccacctc tgcaagaccc 
attctgaata aagtcttctt cactgtgctt 
acacagctga agtaatagag acattaaccc 
atatatgctt cagcctccca tcagcagcag 
ggaagcccaa ggttaattct ccatctttca 
cagaagttcc gggagcagga accaagaaga 
acaccctggg atggggggag agagggataa 
gatggggaca gtggtttcaa gatacacatt 
cctgactctc cttgataaat ggtttggggc 
gccatcatgg aatggtgttt ctgggatgtc 
ctcagtgcaa gtaagacttg gcttccccaa 
acctctcaat gacttactgt gaatcctgga 
atctcatctg tgaaacaggc aaaataacgt 
acatgaaata agcagcttct tgtgagatca 
tagaactgtg caggtaggac ccttgacctt 
cagtgacatg atccttcttt cttctgaaag 
ccaaggcctc aattttagat tttatttttc 
agtagtagtc gtcattatca tcattattat 
acttatgtcc cagacaatgt gctaagtact 
tttgagcaag gaacctatgg tcatacagtt 
ccagcagcaa tagcagaaag caggtagaaa 
cctgaagcct agcacaatag cactggcctg 
aagataccag aagttaccag aagctccaga 
ataagacttg agcagatcac tttacttccc 
tcataatctc taaatggcat acttcagggg 
gtgcaagtgt gtcataaaca tgaagttttt 
ttaacaacaa taataataac agctggcatt 
atcaaaaaat atctctttta atccttagaa 
ttccattaaa caaatgagaa acttagtcat 
tgccatgctg attattagta tgaaaattga 
gataattaat gacactttca gtttgggcaa 
tatccaagtc tgtttctttg gaatgagtca 
cttgaactgg aaataaggga acactaaaga 
gtgaagctgc tgatggactg gaattgctcc 



ctatatcacg tcccaggcaa actgagatgg 192900 
atttgaacat ggagtaaatt gggttttgga 192960 
gcaagagacg aggtagggtg agggtacaag 193020 
taaggaatta gggatagaag agctgggttc 193080 
ggatcttgga ctaattgctt gccttttcca 193140 
cataatcctc ctagtgctgc ctccctgact 193200 
agcatcgtgc ttgtgggcca agcatcgtgg 193260 
gctgaggtgg gcagatcacg aggtaaggag 193320 
ccctgtctct actaaaaata caaaaagtta 193380 
agctacctgg gaggctgaga caggagaatc 193440 
agccgaggtt gtgccactgc ataccacacc 193500 
aaaaaaaaaa aaagcatcat gcatgtgttt 193560 
gtcagagcca agattattct tcccatgact 193620 
tgagatttgc tctggccaga gtacctagaa 193680 
ctcacagcca gtgagcatcc aggaggaagg 193740 
caggagccct gcgtccagtt tgactgctcg 193800 
ccaatgaggg caggggagag cagtcactgt 193860 
aaactttagt caggcttctg aaccttctcc 193920 
cagttttggc aaagaacttg ctaagtcggt 193980 
ctctttgcca tgatcatctt cttcagcccc 194040 
tcgtccatct tcagcaagaa tcctcttagg 194100 
atgcttgctg ttagcaattt cctatccact 194160 
cccacgggcc catgctctat tcagagttga 194220 
attgcagggg tctctatacc tattgctacg 194280 
taacaagtgt gctgattaat tttttcttta 194340 
gcattctctc agcaccaagc aggaaaggaa 194400 
actctgagct ggcacccagt accctttaca 194460 
tggagggagt tgagggcgcc aggatggcga 194520 
agtgcctggg ctccatcaat ctgacctcag 194580 
gtgcccagat cccagagctc agacctgatg 194640 
tctgagaagc ttcttcctct gtctctgagt 194700 
tgagccattc tcttgaactc tccttgctct 194760 
tctttcgcat tctctgatac tttgttttat 194820 
ctagatacta agagccttca gttacccttg 194880 
gtcaagcgcc atcagctctg agtctcaata 194940 
ctgctctggc tactcactgg gtttctgtga 195000 
ccgtgacagt gca'aggggtc cttgggtcac 195060 
attggaaact gaggaaggca gtaaaagtgt 195120 
agtccccttg tcttagacaa cctaccagcc 195180 
ataactttta cccactctac acagcacagt 195240 
catctaaata atataaacta tgcattgaaa 195300 
tcttggagga ggaaactaag aagcagatag 195360 
agaatgagca gagcacaacc tgggaaatgc 195420 
acactacatt gcacattttt gtgttgtcct 195480 
tgtttcggcg aacacttggt accttaagtg 195540 
ttttagacct gcctctgcca ctggttagtc 195600 
agaatttggc atctcccatg taaataaaga 195660 
gttcctgtga ctagagatga aataatatgt 195720 
agaaagtaat attagtatta agtatcacca 195780 
ttttgaatac tcatcatgca ccaagcattg 195840 
caaccttaag acatagttaa ctctaattgg 195900 
agagagatta tgtactctgc catactaata 195960 
gactggggct gaaaagtaga tgcctgtttg 196020 
atcagtctac cagtctacca tctgcacatc 196080 
cttacatggt acccaccaaa ctaacactat 196140 
ttggcaagag attactgagg tgataagaag 196200 
tttcttttat tctttgtagt agaatctgta 196260 
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ctttgttttc tctctgaaaa gcaaagatag 
ctgatttcat ctaagaaagt aaagagcatg 
ttgtgacctc ctcactgaga cttcaaacca 
ggaacacaca cacatgcaca cacacacgca 
ccccaactct tcttatacgt tctaacaaca 
aaaactctgc aaagcttcct tttaagccat 
acagggttgg atttgtattt gtaacaagta 
gggatctgag gacacatggt ggatcactat 
atgatacaca gctagactca gactcagttt 
acctacagag cagcaagctc aaccttaaat 
tttgatatga atttgaagat agaaggccat 
gtctcacgcg gcatatttga gggttccaat 
ttaacacctt gcattctctt atctgaacac 
tcacaccact ttcacattag cagttatcat 
aaactgaagg cttagagagc taaattgcct 
gggttaaatc caagtctttc aattgccaag 
agaaattctt aaaggaatta agaagccatc 
agctggcttc ttgtcctggg tcttctgact 
ttgcaggccc cccatacttt cttcatcttc 
agtggttgtg tgatgggagg tgtttggtta 
cactctgacc cattgttagt atttgcctaa 
atcccaattt agggaaaaga gatctgtaat 
agttttagca ttcaggatac tccaagacat 
ttctacttca gtggaaaaag caaacacaca 
ttggttttgt tttccaggtg cgatttgaag 
aaggacgccg gaccaactac acgctccacg 
aggtaaggtc cccctttact tctgttctgc 
cagagctgag ggcctgtgag tccacctttt 
acacagattc tagacttggc tctgccacta 
ttcctctgag aatccattca ttcatttgca 
tcccaggatg cttgtgaaaa tccaaggtag 
gatgtgtata aaatgccact cccattcctg 
tgattattga tgaggttaac tatctcttcc 
atttatacaa aagtaagggt gaaaatatat 
gtttgcatat gaaaattctt agaatagcat 
accagaacct atgtgcccgg gcctctgccg 
ttcatctctc cactccagtc attctgtttg 
ttctcatcat tgagccttca tacatgtttc 
ctcctcttgc ctgccaatcc ctcctcagcc 
gcgaagcctt ctctggcctt tcttattatg 
ggccttcatc acattgctaa ttccatttgg 
acagcatggc gtagagctgg tagggtgcca 
acactgaatc ctggcttcac cacttactag 
tttgttcctt aattctctta tctagtaagt 
gggtcattga gaggattaaa tgatctaaga 
cccattgaaa gttgggtctt aatctctcag 
ataaccttgc cactagccgt atgtggctgt 
aaaaatcagt tcttcagttg cactatctac 
agtggctacc acatcaggca gcacagatgt 
ctttaaagtt ctctttgtcc actctccgtc 
cttatgtcta gaaccgcatc ttctacctca 
tgtcagttgt ctattgctgc tgtacaaatt 
tttatcatct cacaattctg taggtcagaa 
aaagtgttgg caaggtcatg tttcttcctg 
cttttccagt ttcttaaagg ccacctatat 
catattcaaa accagcaatg ttacctctct 
gctaaccaca gccttccaga gcccatctcc 



cccagacatt tatggagctg gggatttctc 196320 
ttgtaaaatt tgctcagtgc tctgggcctg 196380 
gaaggtgaca agagggtcaa cctcgaaaca 196440 
ttctctctct cactctcaaa cacacacaac 196500 
gaggagctag ccactgtcag aaaaaaatat 196560 
ttgaaggagg gaaataaaca gtgctgggtc 196620 
gacacctcac ctcactggcc ttaggcaaga 196680 
tagggaatgt ggttgcttgt acctgagctg 196740 
gagagtggtc ttggatgaca tttatcagtc 196800 
atcttggatc tccatggtca ccatttgact 196860 
gaaagtttga ttttgcccat ttgtggagaa 196920 
acaattttat tttagccaac tgagatttct 196980 
aatgggttct ccctaggtgc tttgaagttt 197040 
ttaagataaa aaaagggcag agatttttgg 197100 
gagggcactt ggctattaag gattaagctg 197160 
tattttattc cctggactca ggaataaaga 197220 
tccaagcata gcttaaggct tcattcataa 197280 
cattagtgat ggtgcaccat tttctttcct 197340 
agatgtgtat ctgagaaggg cagtggtcat 197400 
gggatggtgc agacatttca cattagtatg 1974 60 
agaaataggt aaaaacaatt gtcaggatga 197520 
gcatcccttg cctgtggcaa gggagtgaag 197580 
cattccatgg aacactcttg ggttctgaat 197640 
taaagtacat ctgagttcac tgcccaccac 197700 
gtttaacagg aaacgtgcag tttaatgaga 197760 
tgattgaaat gaaacatgac ggcatccgaa 197820 
agagagaaga ggctgagcag ggactctggc 197880 
ctggactgga tctttgaaga aactcagaca 197940 
accagctggg acattgggca agtctcgttc 198000 
aaattaagtt taaaaaaatc tctacatttg 198060 
aggaaagcac ttctaaaaca taaagtaatt 198120 
agggtttcta aactaagaac ttgagaatga 198180 
taatcgatag ttggtcatat ccactctatt 198240 
atgtttacat atatgtataa tgtataatat 198300 
tcaaatcctt ggaataacat ttcgaattct 198360 
atttcttcac ccttaaccct tgatacttcc 198420 
tctttcacct ccctgaatac accatgttct 198480 
ctctgcctgg gacttcctcc ccccatctcc 198540 
ttccacactc aaagtaaatg tcattcccca 198600 
cttcatggaa gcctctactt ctctgctcat 198660 
tggcgtagca taatgactgg gaggcattgt 198720 
tggatcagaa tgctggctct ggaatcctca 198780 
tctggggttc cggataattt atttgagctc 198840 
ggggatcacc ttaaccatgc caacctcata 198900 
gtgtgcagaa gttctaagta ctggaactgg 198960 
acttaatgct ctctatgcct gtgctgtcag 199020 
ttaggttgaa ttttaaataa attcccattt 199080 
atgtcaagtg ctcagtagcc acatgtggct 199140 
agaacattta catcactgca gaaagttcta 199200 
ctcacctccc ttaaacctgt cttctctgct 199260 
cagccaatct tatctcattc attaaggcta 199320 
accacaaaca gtgacttaaa acaacacaga 199380 
gtctgacttg agtttcactg gactaaaatc 199440 
gaggctccag agttaagttt gtttctttgc 199500 
tccttggctc ccatgttacc aagctttctc 199560 
ctgacccttc ttctatcatc acatcttctc 199620 
catctctgac tgggagatct cctccttcag 199680 
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gaatcagcta cagtaacctg ccttgactta cagcaggtac caaataaggg caccctattt 199740 
ggctatgaat gtagccactg ggaacgtatc cacttcttca gcaaactttt ggcatgttga 199800 
ttgtacacag acactctttc tgggctcaaa aacccttgag aatgacattt atccagaaat 199860 
agtccaagtc tgagttcatt tttgttgtcc catggtcttc tgatttttgc tgctggagaa 199920 
gacatttgct aggtcactta gctcttataa ctttcaattt ccacatctgt aaaacgggca 199980 
cagatcatgt ctgcattagc gcctgatcac tgttcagtga atgttacctg tgttagtggt 200040 
tattaatagc tatagtaaat gttgaagagt tagcacacgg ctgaaatctg gaaaacactg 200100 
aggtggcttg tttatttctt tcaaagcatt tagcagctac tgttaagagt actgcctctg 200160 
gaagggacac caaatatgtg accatctccc caaggccatc ttcctttcag attctttttt 200220 
tgtttttttg gagacagagc cttgctctgt cgctcagact ggagtgcaat ggcatggtct 200280 
cggctcactg caacctccac ctcccggatt caagtgattc tcctgcctca gcctcccgag 200340 
tggttgggat tataggtgcc catcaccaca cctggctaat ttttgtattt tgggtagaca 200400 
cggggtttct cacgacgttg gccagactag tctcaaactc ctgaccgtgt gatccattca 200460 
cctccgcctc ccaaagtgct gagattacag gtatgagcca ccacccccag gctttttttt 200520 
ctttctttct ttcttttttg aggcagggtc tcgctctctt gcccaggctg gagtgcagtg 200580 
gtactatcat agctcactat aacctcaaac tcctgggctc aagtgatcca cccacctcgg 200640 
cctcccaaag tgctgggatt acaagcatga gccactgtcc ccagcccctt tctttcagat 200700 
tctgagaact gctggtatca cctcctgcct tgacaatgaa aacatagatg cagcattttt 200760 
gtgtaaggcc caagcaggca agcctattgc tcctagatag aaaattgttg accatagacc 200820 
tgggcttttc ctcagaatcc tcagaatggc cttttgctct cagggaaaaa agaatattgc 200880 
tatttcctgg atagaacaaa ttggcatttc tacattgtat atgagtagtt atgtgtttgc 200940 
cataattaaa atatggatgt cgaactgttt tccccaactc tggagtagtg gattagctta 201000 
aaatatagcc agattatttt ctacacgtta gtggtcactg ggtatattgg ggatatactg 201060 
aaggtgcaac aagtaagaat agaagccaga gacttctatg ccatttctgc catcatccag 201120 
ctgtgtggcc ttcagcatgt ctcatagtct ctctgggtct ctaaaataag tgagagacat 201180 
tgggctgtgc aaaccttaat tcctctcagc tctgacacct tggatgtcag tgccatctgc 201240 
tgcattttta ccatcctctc cctactcaca ttgttcctct cctggctgga cttcggcttc 201300 
ctctgtgaaa gtagaatttc cactggggcc ttcatcaatt gacaaactgt cctttggtat 201360 
gaaattcccc agttgttact cccagaacca gaggccccag ccctttgagt tgaccctaac 201420 
tattgcccag agactgttgg gaggcttttt atctcttcaa gacttgctct tcatcaagaa 201480 
cttctcagaa gtggaggcac tgtagcaaga ttaatggttc tcaagcctca gcaaatagaa 201540 
gaattactgg gaggccttat ttaaaacgca gattcctggg tcccagcccc tgggtactct 201600 
gcttctggag aacagagatg aagcctagga atctgcacct tcttgaatcc tcctctccaa 201660 
cccacagatt ttctggttgg tgcgtaaaga gtttgataag aatcactggc aactactaag 201720 
gaaagaagat ggaagtaaaa ggaatacaaa taatataata taatataata taatttataa 201780 
tataatttac aaaaaaagaa acactgacag ggtggctaag agcttgggct ttagaggcag 201840 
acagaccagg attttaagtc tgctttagtc tttactaggc acatgacctt gaacacatta 201900 
tttagcctcc ctgcaagtca ttacatcatc tctaaaaggg tggtaatact ctttgcctcc 201960 
caggatcact gtcaggattg aataagacaa tgcctgttac accactgctt gtagtaaggg 202020 
attcggaaaa gtagctcctg gcacgtagcg gacattcagc ccacatttgt ttacttgttt 202080 
gtacttttaa actcccttct ataggccaca gggattttat atcagcattg catttggggg 202140 
agtcaggctg tgtaagcctg tggaatggat aaatcagcaa agagaaataa cagagtacct 202200 
taaaatccat gagcaaatgg ttgtgcttta gttttcagat cccagggctg tttttaaact 202260 
taccttccaa tggccaaatc ctttcttaga atgtgggctt aaacgtcctt tgtttatggg 202320 
ggatttgcct ttcagaaaat caattcctca catctgtggc ttcaggcttt ttactcaatc 202380 
agcctcctaa caaagtaatt tttctgattt tcaagggtga cctagttgta ctggtttcag 202440 
tgagcattac atggccttac ctgattgtgt atagtaaaaa ggagctgttt cttccagatt 202500 
tggagcctca gagaaactct cctgatcttc aggcctattt cctgttcacc catgtgccaa 202560 
tataccctct aggtacctgg atacaatttg acatcccact gaaatgtggg tgttctaagc 202620 
atccattcat cataaagttt aggggcttca gataccagga tgggtacttg ctctactaaa 202680 
atgggatcct taggaaatcc cgttcaagga taccacttca ctatggggta ctgatttagg 202740 
gtctctctct gaataaaata tcttgtcaaa atctggtaga atcacaaatt ctgataaatg 202800 
gctgcccaat ggagaagttt agggaaaatt atcttacatt tttcatcgaa gaactggtat 2028 60 
tgttgattta accgtctttt cccatcacca ttgttcctat agcaattttt agactgggtg 202920 
gaggaatcta cttctcattt caattcgatt tgagtggccc tgccctgccc caagtttatt 202980 
atccccactt ctgaccctgt gtataactag aatgaaaaga ctgggcagga aacaagacta 203040 
aagggacaca aagtcctgct gatatcaaaa gtaatcttct aaagtcccaa agaatttata 203100 
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gcttgccatt ccatatgctt tttggaagtt 
ccatactgaa tcatgttata gcatcagata 
ctcaccacct ctctccattt ctactgctga 
acacagtagc ctccttacca gtcacgagtc 
cttttccacc acaagacgag gacaataatt 
ttccttaaaa cttttaccac tttcctgttg 
cctcaatgct gcttccctgt ggcatcagct 
gcagttacac tggcatcttt gcattccttg 
tgcatatgct attcctttac ctggcatttt 
tccttgtcat tcaggtctca gtccaagtga 
gcacttttaa agtcactccc actcacccag 
cttttcatgg tgcttattcc ttctgatata 
ttggcaaact tctgcctact ggccaaattc 
agctaatggt tacatgtaca tttttaaatg 
tgacatgtga aaattatgag agattaaaat 
agctatgctc atttgtttat atattatcta 
aagtagtggc aacacagact atatgatcca 
ttaggataaa gcttgtcaac ccctggtctg 
cctaaatgtt ctaaaacacc aactctttaa 
attcacagtg cctagaatat actgaaacag 
ttacatcact gggaagttta tcaaatgttt 
ccatatttca aaagatctgc caataagtaa 
taacacctca agcagaactc agataaagtt 
aagtgtgtta atttagataa tcctaggtgc 
tttaatgcaa aagaagcttt tttttgttca 
tggctaagac tcaggctcat ggaccctcca 
atccagctaa cagataggaa tgaaagaaaa 
cagcctagaa gagatgcaca tcacttctgt 
ggctcaatca gaggtaaaac aagctgggaa 
accaacatct ctacaccatg gaaggaaaat 
ggccacaata gacaacttag tctttaaaaa 
aaggggctgt ggtctcagag gcagcctgga 
tgtgcaatat agatcaattg aattttcagt 
cacttaaaaa atctgatctg tcatctagat 
gtagatggaa aacacggaga tcaaaaatta 
tcagaattgt ttaaatcctc tcctgactgg 
tgatttactt ttagaaagaa tcaaaattcc 
gtattataat ccaaccattt cccaaagaaa 
tgtctgttca aaaatctttg ccagctttct 
gtctactagt ttacatgagg cataggcatg 
ggcaatattt tgttttatct ccctacaaaa 
tttctaacta ttctactagc ttttgatatg 
cacatttttt tagagatccc aatttaacaa 
ttgaaggaca actatcctta aaaattaagg 
cacagattca tcttgctttt ttttcctatc 
ttttataatt tttaaaccac ataccctgaa 
caagtgaatt tcaaaatgct ctataatgca 
aattgaaaac agaaatgtct tgtagtatag 
ctaatgaata tcatttttta gcttactgac 
tacatgcatt aactaatgta gttctcatag 
ccactctaca gatcagggac tgaggaccac 
ttggtaatta ggaaagccta gcttttaaac 
cttgaccact gggctccctc ttttttaaaa 
aggaattggt tgcatgtcaa aggaagcaac 
gtaggaatat ttaaacctgt ggactaattg 
aggaggaaga ataggaggaa ggaagggaga 
agaaggagga gaaagggaaa agaaaaaaaa 



ttgctttaaa tccttgttac ccaattttga 203160 
tagacatagc atctcagata tagcctaaac 203220 
gcccagagcc caggccatct tccacacacc 203280 
ctgctatgtt cttgcccctc tacaaggcta 203340 
ttcaaatata tatcaggtta tatggctcct 203400 
ccagtaaata tacaacatgg ccctcagagc 203460 
ctcactcctc tttctctagc tcatttctct 203520 
agcacagcct cgcactttcc tcaagtcact 203580 
ctatcactag agtttcatgt gctgtgtgcc 203640 
agccaccttg gaaacgacat ccttgaccaa 203700 
ctcacagtca caatttcctt tttatttgta 203760 
tttttattgg ttgattgatc tagattgagg 203820 
atcccaccac ctatttttgt acagcccatg 203880 
gttgtaataa atcaaaagaa gaccattttg 203940 
ttcagtgccc ataaagtttt tctggaacat 204000 
tatctgcctc tgcatgacaa caacagagat 204060 
caaagccaaa gatatctacc gtctggccct 204120 
gatatgtatt ttcttcatac ttctctacct 204180 
aagcaaggac catatttgta tttttattat 204240 
agtaagcaca tacatataaa aggttgttta 204300 
ctgtgcaatc agagagcaca ctatgctagc 204360 
gggaaaaaaa tgcctgcaaa tttaataaaa 204420 
acacataaac ttattaactt caaatctttt 204480 
tgtaacagat aaactctgaa atctcagctg 204540 
ctcagtgacc acttttttgg ctgagggttc 204600 
ctgtctactt gggtctttct ggatttcaac 204660 
tggcaactct gctggagatt tttggggcat 204720 
tgtattccgt tggccagaac ccagttactt 204780 
atgtcattgc tggccaggta gccactctcc 204840 
aagaatgctt gggagtcagc taattatctt 204900 
ttgatactaa aggatcttat tttttgaggt 204960 
gataccaatt tctgaatttg atttgatgac 205020 
tctatgatca tctcctttta agactttctc 205080 
tcattttcta cattcaaaga atctttagag 205140 
taactggccc caggagaaga cctctcactg 205200 
taggccccaa tctttaccaa ctactaactc 205260 
tgcatatcct cagtagagaa tcaaagtgct 205320 
ggaaaatttg ctctaatctc caaagtgcaa 205380 
tttcatcaaa ctgtggaacc agatttctca 205440 
aggctgagta tcactcagcc cactgaccct 205500 
gagttagaaa taaaataggt ttgtttacaa 205560 
aaagtaccta gcatataagc caagctatac 205620 
tagtgctaaa attaataagt agtgctagct 205680 
tgtctttcta agccatatac atctggctta 205740 
ttgtatgaaa accttgggat tttacggcag 205800 
cattctatac ataaagaagg gttatattgt 205860 
tgcttctttt gcagataatg ttcaacagca 205920 
aaacctgtta accttcaact aggcactacc 205980 
aatatgaacc atacactgtg ctaattgttt 206040 
cagcccatga ggcaggtacc attgttacat 206100 
agaggtcagg taacttgctc aagctgatag 206160 
caggtctggc ctgaggccag agaccaagat 206220 
taatacatct taatactcag tggcttttag 206280 
ttcctttcta ctgatggttg tgcagtttga 206340 
tgaagaagga agaagaaaga caggaaggaa 206400 
aaggatggat gacagggaag aaaatgaaca 206460 
accttaggac catctatgag gggataaatc 206520 
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atatttgaca 
gtccacaaga 
tagatcattt 
taagcctgcc 
aaaaacaaaa 
taaacatcta 
aataggtgaa 
ctctctcctc 
cttagggttc 
gagcaaattc 
aaaatgattc 
tctgtcatta 
atggtacaca 
ccgtcagacc 
tgctgcctgg 
gatgttctag 
agaaactcat 
ctaatgggaa 
ggagttggtc 
tacttcactt 
cacttcgcag 
cctgaaatgc 
ccttcttagg 
tccaggacaa 
caggccagac 
tcattagtgc 
aactgaagga 
aatgcctcag 
aatgcatatg 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnctactg 
tctgggcaga 
gtccagacca 
gtgtaagatc 
cttctgcggc 
atctctcaga 
tagctcagag 
atgaattagt 
taggcactcc 
aaaaataaag 
cctaaaaata 
atgagttcac 
acaactgcag 
tgtaaacagt 
tggggatttt 
ttctgtttac 
ctgtctctgg 
tgacacctcc 
tccctgcagc 
acatcgtcac 
gctaggccag 
ctgtgtaata 



gatattttcc 
cctgggttct 
aatctcccta 
tgcactatct 
caataccaaa 
tttttcagct 
atagataaaa 
ttcactgccc 
cctagaacac 
tgtggtctct 
ttcagtcccc 
gaaagcaatg 
tgcctgcact 
tccgagctgt 
atgcattcat 
actaagacag 
agcaccatgg 
cgtacacaag 
tccctggctc 
tccacgtctc 
gttttgtgag 
aagcattcag 
aagaacagca 
gctcaaccac 
gaaatggcat 
tggcaattat 
actgggaaag 
tttttgtgag 
cacttctagn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
cccttttggg 
ttcctgtctt 
ttcatctctc 
ccacacttgg 
catgtcatgt 
cttctttgca 
ccttgtgcac 
ataaataacc 
tatatattca 
gataaacaac 
aagtaaaaga 
tgaagtgcag 
ggaattaaat 
taagccctta 
ccctgggagc 
caatatttta 
tattgactgg 
actctcttct 
caccgatgcc 
aacaatccta 
cacaagggtt 
gataaaagca 



caggtttgta 
atgttctgcc 
agcctaagtt 
gcatagtgta 
ggagaaaact 
agaaggctcc 
agcagagctg 
ctcaccacct 
agtccttgca 
gcgcctctgc 
ttccagttct 
tctccttcca 
gagggtaagc 
ctcagcatcg 
tctgtccaca 
gaactctgga 
agagatgggg 
gtactcaggg 
atatcctggc 
gctttctgca 
gatgaaagga 
taaatgataa 
tagtggctaa 
tttgcagcta 
gaaaactgtt 
tgttgttaat 
acatgaaggt 
cctatgctga 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
ctaactctat 
catatcttgg 
atgatcataa 
cttttgcttc 
cttcccatgg 
tctcgtcaac 
aatagcagtt 
tactcaaatt 
atattgtaat 
catcatagga 
atgtaatgtt 
ggttagtaga 
cccatggtcc 
tccctcagag 
aaaattgggg 
gagctcggta 
gtgacccaga 
cattaacaga 
caagctgggg 
gtgagtactc 
ttccaccagg 
gcaagtcgaa 



ttgattttat 
actaatctac 
ttcctatctg 
aagatgttat 
ttacatcata 
ttgagaaagg 
accattagag 
tccatgagcg 
aaagcgaggg 
ttctccctgt 
gacattctag 
ccttgactaa 
ctccatgcag 
atagggtgca 
aacatttcct 
acaagtatgt 
acagacacac 
ttgtgtctgg 
tccacggcta 
tacttaaaat 
gttaaaacat 
ctaatatgat 
atgcagcctc 
cttcacctcg 
gaacacttag 
ggctgcgaga 
gaaggtgaca 
gttctggctg 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
cttccaccca 
ctcatattct 
gccctgattc 
tctgaactcc 
tagaccagag 
cctgcctgac 
caattaatat 
tgaaaaatta 
agtttatact 
agttaaatct 
taaatcatat 
catgggggaa 
tggcttgcat 
gtggcaattt 
gcacctgaaa 
tgtccaattc 
gcaggccagt 
ttggttactg 
gcgataattc 
agtccttcat 
actgaaagct 
aagggaatgc 



taaataatct 
tatataacct 
taaagcagag 
aaaagtactg 
atatactacc 
caactcttgc 
caaggatgag 
gtgaaaggcc 
aactgagacc 
gaaaaaggaa 
aaacgtgagt 
cagcatccct 
tatcctgaga 
ccatgtccat 
gagctgcttc 
ttatttcagt 
aaacagctag 
catgcagggg 
ccctgtggcc 
tgaaaggata 
aaaaagcact 
ttcaagtcca 
tggagagaga 
ggcaagttaa 
cctctgacat 
ttacaggaaa 
tttcagctga 
ttgatagacc 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
ggacccacag 
ccctaacctg 
ctcctgttgc 
ttaggcccac 
gcgttctcag 
aacaagcaac 
ttcttgaaag 
atattgtcaa 
tggagacaga 
agatggaaaa 
gattccaact 
agaggggttt 
tgtacaaaac 
ccaaactgaa 
agcactgtgc 
ttcatgagct 
tcagaggagg 
gaatgaagat 
aagtgttcag 
caaggttact 
ggcctttctt 
cctgaaaatg 



tacctttgag 
attgttttgt 
aaaataatcc 
tggagattta 
agagtgcttt 
ggggaatccc 
gaagccagag 
ccacaagtct 
acactgtctt 
aaagttggac 
ttctggttct 
cacagggaac 
gacatgctcc 
gctgggtcac 
tctgggaaaa 
cttgtccatg 
acccagtgtt 
gcacggccct 
ttgggaaagt 
attaactacc 
ttaaaaagtc 
gagcccaatc 
ttgtcagggt 
cacactccca 
agcaagagca 
aactatttcc 
ccatttggca 
tgttcatccc 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 
aacttaaaca 
aaaggttttg 
atttttccct 
tcacttgtca 
agcaagccct 
actaatgttt 
accttggaaa 
ccataatttt 
agaaattttt 
atacagttaa 
atagttcaga 
gaccctgtga 
agactggctg 
ccaattattg 
ttcttattat 
ccttacttgc 
ctggcccacc 
gataagtttg 
aacagaacat 
tgggattcaa 
cttgccaaaa 
agaggctctg 



206580 
206640 
206700 
206760 
206820 
206880 
206940 
207000 
207060 
207120 
207180 
207240 
207300 
207360 
207420 
207480 
207540 
207600 
207660 
207720 
207780 
207840 
207900 
207960 
208020 
208080 
208140 
208200 
208260 
208320 
208380 
208440 
208500 
208560 
208620 
208680 
208740 
208800 
208860 
208920 
208980 
209040 
209100 
209160 
209220 
209280 
209340 
209400 
209460 
209520 
209580 
209640 
209700 
209760 
209820 
209880 
209940 



125 



CL001205 



agttttcact 
cttactttct 
ctggttttca 
gtagctgtaa 
tttagggtct 
tcctggaaca 
aacaaagttt 
ggaagtgaat 
atgccaaagt 
ttctcttcca 
atgaccgtta 
actcctaccg 
aggcctggaa 
aattagaggg 
ccacagatgt 
caaaatccaa 
cactcagagt 
agatcaattt 
cagtatcagt 
ttgcctctct 
tactctgttc 
acctatatca 
tgaatgtgcc 
aatggtttac 
tttatagtgc 
aattacagca 
tattgagaag 
tcttttaaca 
aataaggagt 
gtggggatgt 
ggctcacgcc 
ggaatttgag 
ttagctgcac 
aatcacttga 
gcctgggaga 
aagcagccca 
tgagaggaga 
ggatacatat 
agatcacgaa 
ccttgtgcag 
cccaggtcaa 
gtttcatatc 
agaagaaaaa 
tgaaggtggt 
cacctctcac 
atctggaggc 
ctgctgtgca 
tttgcttggt 
aacacagctg 
ggtgcctcag 
ccagtaaaaa 
aggctgaggc 
aaccccatct 
cctaggaatt 
cttgcaatga 
nnnnnnnnnn 
nnnnnnnnnn 



ctgtatctct 
cttgagccag 
tctgtcctcc 
gtgaagatgc 
atttctcctt 
taatttgaaa 
gcttccttaa 
tgaggggcta 
cctccctacc 
ggaagatcct 
cgagggctac 
tctggagatt 
tggcatggtg 
cggaggcaga 
ctatgaaacc 
tttcttcaga 
cttgtgacgt 
tcttgaggca 
ttcttctctg 
agtttcatgc 
cctagagtgc 
aataatactt 
ttttaattgt 
cctgccttat 
tcaccacact 
gttttgaaaa 
caaaagattt 
ttttatgggg 
ctatattgta 
gaaacttgaa 
tataatccca 
actagcctgg 
atggtggcgg 
accaaggaga 
caagagcgaa 
ggaattttta 
atgagaaaac 
gagtaaaaaa 
ggtatttgag 
ggaagggacc 
tgttaagtgc 
agggcaagag 
gatggaataa 
tattgagttg 
tttgggaagc 
tggtttccat 
cttttcacct 
taaaagtctg 
gtggtgagtc 
cagccaggcc 
ctgttctgac 
gggtggatca 
ctactaagaa 
cgggagctga 
gccaagatgg 
nnnnnnnnnn 
nnnnnnnnnn 



tagtttcact 
ctttcccata 
ctcagtagaa 
tgtaattttc 
caagatgacc 
agctgtatgc 
cacaagaagc 
gagagccttc 
catgggtgaa 
tatgtgatgc 
tgtgtagagc 
gtcagtgatg 
ggagagctgg 
gggtttgaca 
ctgtaattga 
cactctttgt 
cagttgattg 
gactgctgag 
atattaaatg 
tctctcatga 
tcccttccac 
ttttcagggt 
tctcgccttc 
ataacggtta 
tgaaataact 
gctgctgaat 
gatcttttgc 
tagaaaagac 
tggctcagct 
aaatgaccat 
gcaccttggg 
ccaacatggt 
gcacctgtaa 
tagaggttgc 
actctgtctc 
aagtataaac 
gcatggtttt 
tagcaagagg 
tagagagaca 
accacggctt 
tcataaggac 
gagccatgat 
aggaaatctt 
gggaatggga 
tccatggaca 
gggaactccc 
tctctagcat 
cagtgtgggc 
actggcacag 
tgcctcagta 
caggtgcggt 
cgaggtcagg 
tacaaaaaat 
gtagttaagg 
tgnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



gtctcatcct 
tttaaaacat 
gctattttca 
agtgactgat 
ccagaggtat 
catatctcag 
ttgtaatggg 
cccagtgtta 
cccataaccc 
tcaagaagaa 
tggcggcaga 
gaaaatacgg 
tctatggagt 
ggaaatcatt 
gtgttgttgc 
tcaggttact 
tcgtccaaca 
ttgtctatac 
catctggagc 
aatttccaat 
tctccacccc 
gtgtttcttt 
ccctatagaa 
cctgtgtaac 
ccatgcacaa 
gtcccacatt 
ttaatacttt 
aatgtgagaa 
aagggctaac 
atttaaacag 
aggccgaggc 
gaaaccccat 
tcccagctcc 
agtgagccaa 
aaaaatgata 
aataggatgg 
tgttgaggac 
aagagtgtgg 
acaaaaggca 
cactcataga 
aagcagccac 
ggccctgaga 
cctcagggag 
tgtgcttctc 
aaaggtagag 
atcaagctct 
ttatcttcac 
agaacaatgc 
cagggctcag 
gcctcctcct 
ggctcatgcc 
agatctagac 
tagccgggca 
caggagaatg 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



ttgtaccctt 
aagaatgaca 
aagaaagtcc 
catgagaaac 
ttccaaaaaa 
tgataaaaca 
gtagggagtg 
accaaacatg 
acattctgct 
cgccaatcag 
gattgccaag 
agcccgagac 
aagttcactg 
tggtggttgg 
tgctgaacag 
ggtcccaggt 
caggtgacag 
aaagtcactt 
caacctaact 
tcagtcaaat 
taagatacta 
ctttcttctc 
tttagttgct 
aggggtagga 
ttgctataaa 
gttttaagca 
atgaggtaga 
aatagtaaaa 
tagagtttgt 
accgagaggc 
aggtggatca 
ctctaccaaa 
ttgggaggct 
gaccatgcca 
ataaaaaaca 
tggtggaaat 
ttgggttgaa 
atgattcagt 
atggaagaag 
aactaaaaca 
tgaggaagat 
gacatacaaa 
gtagcttttg 
tcctttctta 
acctggatag 
taaggtcatc 
cttcacctca 
caggatcatc 
gccagctgga 
tatcaggcct 
tgtaatccca 
catcctggct 
tggtagtggg 
gcgtgaactg 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



gggtaactta 
ctaaattgtg 
aatacataaa 
ctggagcctg 
atacaaaata 
ttcttatttt 
aagacattta 
atattatgct 
atctccccat 
tttgagggca 
cacgtgggct 
cctgacacga 
cagggtggga 
gtggccctgc 
atgagtcatc 
ccctcaatcc 
catagctcca 
gtggctctct 
ttctagttac 
gccccttaat 
ctccttcaaa 
ataataggta 
ggtttttttt 
ctattctatc 
atcttcaata 
tttggggaat 
aaagacccca 
ggtagtatag 
ctagccctca 
cgggcacagt 
cctgaggtca 
aatacaaaaa 
gaggcaggag 
tcgcactcca 
tatgagagaa 
aagtaaggtg 
gagtagtcag 
gggcattggg 
cctccagctg 
catgccaagc 
gataaatctt 
gggaattcaa 
agatgacatg 
ttccctttac 
tacatgctgc 
cccatggcaa 
tgtaatgtgg 
cccagtagcc 
gttacctatg 
attcctctgc 
gcactttggg 
aacacagtga 
cgcctgtagt 
ggaggcggag 
nnnnnnnnnn 
nnnnnnnnnn 
nnnnnnnnnn 



210000 
210060 
210120 
210180 
210240 
210300 
210360 
210420 
210480 
210540 
210600 
210660 
210720 
210780 
210840 
210900 
210960 
211020 
211080 
211140 
211200 
211260 
211320 
211380 
211440 
211500 
211560 
211620 
211680 
211740 
211800 
211860 
211920 
211980 
212040 
212100 
212160 
212220 
212280 
212340 
212400 
212460 
212520 
212580 
212640 
212700 
212760 
212820 
212880 
212940 
213000 
213060 
213120 
213180 
213240 
213300 
213360 
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nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 213420 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnn 213456 

<210> 4 
<211> 484 
<212> PRT 
<213> Human 

<400> 4 

Met Gin His He Phe Ala Phe Phe Cys Thr Gly Phe Leu Gly Ala Val 

15 10 15 

Val Gly Ala Asn Phe Pro Asn Asn He Gin He Gly Gly Leu Phe Pro 

20 25 30 

Asn Gin Gin Ser Gin Glu His Ala Ala Phe Arg Phe Ala Leu Ser Gin 

35 40 45 

Leu Thr Glu Pro Pro Lys Leu Leu Pro Gin He Asp He Val Asn He 

50 55 60 

Ser Asp Ser Phe Glu Met Thr Tyr Arg Phe Cys Ser Gin Phe Ser Lys 
65 70 75 80 

Gly Val Tyr Ala He Phe Gly Phe Tyr Glu Arg Arg Thr Val Asn Met 

85 90 95 

Leu Thr Ser Phe Cys Gly Ala Leu His Val Cys Phe He Thr Pro Ser 

100 105 110 

Phe Pro Val Asp Thr Ser Asn Gin Phe Val Leu Gin Leu Arg Pro Glu 

115 120 125 

Leu Gin Asp Ala Leu He Ser He He Asp His Tyr Lys Trp Gin Lys 

130 135 140 

Phe Val Tyr He Tyr Asp Ala Asp Arg Gly Leu Ser Val Leu Gin Lys 
145 150 155 160 

Val Leu Asp Thr Ala Ala Glu Lys Asn Trp Gin Val Thr Ala Val Asn 

165 170 175 

He Leu Thr Thr Thr Glu Glu Gly Tyr Arg Met Leu Phe Gin Asp Leu 

180 185 190 

Glu Lys Lys Lys Glu Arg Leu Val Val Val Asp Cys Glu Ser Glu Arg 

195 200 205 

Leu Asn Ala He Leu Gly Gin He He Lys Leu Glu Lys Asn Gly He 

210 215 220 

Gly Tyr His Tyr lie Leu Ala Asn Leu Gly Phe Met Asp He Asp Leu 
225 230 235 240 

Asn Lys Phe Lys Glu Ser Gly Ala Asn Val Thr Gly Phe Gin Leu Val 

245 250 255 

Asn Tyr Thr Asp Thr He Pro Ala Lys He Met Gin Gin Trp Lys Asn 

260 265 270 

Ser Asp Ala Arg Asp His Thr Arg Val Asp Trp Lys Arg Pro Lys Tyr 

275 280 285 

Thr Ser Ala Leu Thr Tyr Asp Gly Val Lys Val Met Ala Glu Ala Phe 

290 295 300 

Gin Ser Leu Arg Arg Gin Arg He Asp He Ser Arg Arg Gly Asn Ala 
305 310 315 320 

Gly Asp Cys Leu Ala Asn Pro Ala Val Pro Trp Gly Gin Gly He Asp 

325 330 335 

He Gin Arg Ala Leu Gin Gin Val Arg Phe Glu Gly Leu Thr Gly Asn 

340 345 350 

Val Gin Phe Asn Glu Lys Gly Arg Arg Thr Asn Tyr Thr Leu His Val 

355 360 365 

He Glu Met Lys His Asp Gly He Arg Lys He Gly Tyr Trp Asn Glu 
370 375 380 
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Asp Asp Lys Phe 
385 

Asn Ser Ser Val 

Asp Pro Tyr Val 
420 

Asp Arg Tyr Glu 
435 

His Val Gly Tyr 
450 

Gly Ala Arg Asp 
465 

Leu Val Tyr Gly 



Val Pro Ala Ala 
390 

Gin Asn Arg Thr 
405 

Met Leu Lys Lys 

Gly Tyr Cys Val 
440 

Ser Tyr Arg Leu 
455 

Pro Asp Thr Lys 
470 



Thr Asp Ala Gin 
395 

Tyr lie Val Thr 
410 

Asn Ala Asn Gin 
425 

Glu Leu Ala Ala 

Glu He Val Ser 
460 

Ala Trp Asn Gly 
475 



Ala Gly Gly Asp 
400 

Thr He Leu Glu 
415 

Phe Glu Gly Asn 
430 

Glu He Ala Lys 
445 

Asp Gly Lys Tyr 

Met Val Gly Glu 
480 
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